When Two Tech Giants Agree on Something, You Should Probably Listen

1,000×

compute demand growth — Huang, per workload type

30×

workflow improvement with full AI agent reimagination

1K+

new AI server clients added by Dell in a single quarter

I remember the exact moment I realized AI had genuinely crossed over from “cool demo” to “actually useful.” I was watching my team’s junior developer pair with an AI coding agent — not just tab-completing lines, but orchestrating a full debugging cycle: pulling logs, running tests, rewriting a broken function, pushing a fix. The whole loop. In under four minutes.

That was a few months ago. This week, Jensen Huang and Michael Dell stood on stage at Dell Technologies World in Las Vegas and told the world: yeah, that’s just the beginning. Buckle up.

Their joint interview with Bloomberg’s Ed Ludlow wasn’t your typical PR puff piece. This was two people who’ve been building the physical backbone of AI for decades — chips, servers, data centers — telling us exactly what they’re seeing from the inside. And honestly? It was one of the more clarifying tech conversations I’ve followed in a while.

“We’ve arrived at the era of useful AI”

That’s a direct Huang quote, and the line that stuck with me most. For years, AI felt like this perpetually impressive but slightly impractical thing. Great at generating images, decent at writing emails, occasionally wrong in ways you couldn’t predict.

But Huang’s framing at Dell Technologies World was different. He described how agentic AI — AI that doesn’t just respond but actually acts — has fundamentally changed what computing even means. Old model: human asks question, AI answers. New model: human sets a goal, AI figures out the steps, uses tools, accesses memory, and gets it done.

Architecture breakdown

There’s a large language model — the “brain.” Massive, computationally intense. Then there’s a harness around it — what lets the agent actually do things: access memory, call APIs, run code in a sandbox, interact with databases.

Nvidia’s reference harness is NemoClaw, running on their new Vera CPU. The sandbox layer — governing what agents can and can’t touch — is the open-source NemoShell, already standard across the industry.

If you’ve ever set up an AI agent pipeline — LangChain, AutoGPT, newer enterprise tooling — this maps directly to what you’ve experienced. The model alone isn’t the agent. The scaffolding around it is what makes it actually work.

✦

The memory problem nobody was talking about loudly enough

Here’s the part I found most interesting — and honestly a bit alarming for anyone planning infrastructure budgets.

“Demand for memory is outpacing supply — not slightly, significantly”

— Jensen Huang & Michael Dell · Bloomberg TV · May 2026

The reason is agentic AI. When an agent is running — pinging databases, checking its own memory for context, executing tools, running inference — it’s hitting memory constantly and fast. Huang noted that agent frameworks require far more CPUs than anything we built for before, because agents use tools constantly compared to how slowly human users interact.

3×

memory bandwidth of Vera CPU vs. conventional processors

10×

lower cost-per-token: Vera Rubin NVL72 vs. Blackwell

50%

faster agentic workloads on Vera vs. x86 processors

1.2 TB/s

memory bandwidth on NVIDIA Vera CPU

Dell put a number on the shift: companies that fully reimagine their workflows with AI agents aren’t seeing 10–30% efficiency gains. They’re seeing 10x to 30x improvements. Ten to thirty times. That kind of jump changes what hardware you need, how much of it, and how fast.

I’ve run into this personally at a smaller scale. Testing local AI agents on my MacBook earlier this year, I kept hitting walls — not because my GPU was maxed, but because I’d run out of fast memory for the context window. Scale that to an enterprise running thousands of agents simultaneously, and you understand why memory is now the central hardware conversation.

✦

“Demand is going parabolic. Utterly parabolic.”

Huang wasn’t being hyperbolic for effect. He said computation demand for agentic AI has grown 100x to 1,000x depending on the workload — not since five years ago, but in the recent past.

The supply chain is already strained. Dell added 1,000 new AI server clients in a single quarter. Huang acknowledged the semiconductor supply chain is growing — but demand is growing faster. Michael Dell described a chain so intricate it almost sounds like an engineering marvel in itself: silicon photonics, advanced CPUs, GPUs, high-bandwidth memory, all needing to be sourced and assembled at a pace the industry has never sustained before.

The Dell AI Factory with NVIDIA

Their joint platform spans the entire stack: from the new Dell Deskside Agentic AI workstation (local processing, no cloud, break-even in ~3 months) up to the PowerEdge XE9812 built on Nvidia Vera Rubin NVL72 — 10x lower cost-per-token than Blackwell.

Five thousand enterprises — Lilly, Samsung, Honeywell — are already running production workloads on Dell AI Factories with Nvidia. This is no longer a pilot-stage conversation.

The jump from Blackwell to Rubin is the kind of cost inflection that changes what’s economically viable. Things that cost too much to run six months ago might be deployable now.

✦

The China question: more nuanced than the headlines

Huang spoke days after joining President Trump’s summit in China. His statement — that he expects China’s market will eventually open up to US AI chips — grabbed headlines. But the actual quote is more careful:

“The Chinese government has to decide how much of their local market they want to protect”

— Jensen Huang · Bloomberg TV · Dell Technologies World 2026

That’s not triumphalism. That’s reading the situation honestly. Huang confirmed Nvidia is committed to complying with existing regulations — he’s not predicting policy change because he wants it, he’s observing that China’s AI sector needs high-performance chips, and protectionism has its own costs.

Michael Dell, who recently returned from China himself, added that the US administration wants to ensure American leadership in AI — meaning export restrictions exist for a reason, but so does market access. It’s a tension that doesn’t resolve cleanly. Dell’s emphasis on Taiwan as a vital manufacturing partner tells you this is a multi-year geopolitical chess game, not a single move.

✦

Personal AI: the part that got underplayed

Huang’s concept of “personal AI” deserves more attention — the shift from centralized cloud AI to AI running at the point of context: on your laptop, inside a factory machine, inside a vehicle.

Michael Dell backed this up practically: Dell is embedding the ability to run small local models directly inside PCs. Not as a gimmick — but because for knowledge work, the PC is still where people actually do things. If your agent needs to access sensitive company data, you don’t want it routing through a public cloud. Confidential computing on-device is the answer.

I’ve been testing this with a smaller team at work — using a local LLM for tasks involving proprietary data. The performance gap between cloud and local has closed more than most people realize. Not for everything, but for a surprising range of tasks.

✦

What mistakes are people making right now?

Both CEOs were speaking broadly about enterprise adoption. Here’s what I’ve seen from the trenches — because there are real mistakes companies make as they rush into agentic AI.

Treating AI agents like better chatbots

They’re not. An agent needs a specific workflow. “Make an AI that helps our sales team” is a chatbot prompt. “Build an agent that pulls CRM data, drafts follow-ups, checks calendar availability, and logs outreach” is an agentic use case. Be specific.

Ignoring the memory architecture

Most teams deploy AI tools without thinking about how the agent maintains context across sessions. Without persistent memory, your agent is amnesiac. Vector databases like Pinecone, Weaviate, or pgvector exist for exactly this reason.

Underestimating hardware requirements

If your agents are slow or expensive on shared cloud compute, you’re probably under-resourced on memory bandwidth — not raw GPU count. Huang and Dell made this point explicitly. The bottleneck is different now.

Skipping the sandbox

Agents that can take actions — write code, send emails, query databases — need governed environments. Nvidia’s open-source NemoShell sandbox is the industry reference for a reason. Don’t give agents unconstrained access to production systems.

✦

What actually matters from all this

Strip away the keynote language and product announcements: the infrastructure for AI is being purpose-built right now, fast, at enormous cost, because the demand curve isn’t slowing. Memory is the near-term bottleneck. Agentic AI is driving enterprise adoption from “pilot” to “production.” China is a complicated, ongoing situation. And the shift to local, on-device AI is more real and more soon than most people think.

Whether you’re a developer building with agents, a business evaluating infrastructure investment, or just someone trying to understand where this is all going — this conversation between Huang and Dell is worth your time. Find the Bloomberg interview on YouTube. Watch the whole thing.

It’s one of those rare tech moments where two people who actually build the things that make AI run sit down and tell you exactly what they see. No hype tax required.

When Two Tech Giants Agree on Something, You Should Probably Listen

“We’ve arrived at the era of useful AI”

The memory problem nobody was talking about loudly enough

“Demand is going parabolic. Utterly parabolic.”

The China question: more nuanced than the headlines

Personal AI: the part that got underplayed

What mistakes are people making right now?

What actually matters from all this

Leave a Comment Cancel Reply

GF Track APK

“We’ve arrived at the era of useful AI”

The memory problem nobody was talking about loudly enough

“Demand is going parabolic. Utterly parabolic.”

The China question: more nuanced than the headlines

Personal AI: the part that got underplayed

What mistakes are people making right now?

What actually matters from all this

Related Posts

Leave a Comment Cancel Reply

GF Track APK