Array
(
    [content] => 
    [params] => Array
        (
            [0] => /forum/threads/jensen-huang-on-strategy-and-vision-nvidias-future-physical-ai-rise-of-the-agent-inference-explosion-ai-pr-crisis.24792/
        )

    [addOns] => Array
        (
            [DL6/MLTP] => 13
            [Hampel/TimeZoneDebug] => 1000070
            [SV/ChangePostDate] => 2010200
            [SemiWiki/Newsletter] => 1000010
            [SemiWiki/WPMenu] => 1000010
            [SemiWiki/XPressExtend] => 1000010
            [ThemeHouse/XLink] => 1000970
            [ThemeHouse/XPress] => 1010570
            [XF] => 2030970
            [XFI] => 1060170
        )

    [wordpress] => /var/www/html
)

Jensen Huang on strategy and vision: Nvidia's Future, Physical AI, Rise of the Agent, Inference Explosion, AI PR Crisis

user nl

Well-known member
Interesting podcast with Jensen Huang at GTC 2026:

Special episode this week. We've preempted the weekly show and there's only three people we preempt the show for. President Trump, Jesus, and Jensen. And uh I'll let you pick which order we do that. Uh but what an amazing run you've had and a great event. Uh every industry is here. Every tech company is here. Every AI company is here. Incredible.


Mar 19, 2026 All-In Podcast

(0:00) Jensen Huang joins the show!
(1:00) Acquiring Groq and the inference explosion
(9:27) Decision making at the world's most valuable company
(11:22) Physical AI's $50T market, OpenClaw's future, the new operating system for modern AI computing
(17:12) AI's PR crisis, refuting doomer narratives, Anthropic's comms mistakes
(21:22) Revenue capacity, token allocation for employees, Karpathy's autoresearch, agentic future
(31:24) Open source, global diffusion, Iran/Taiwan supply chain impact
(40:19) Self-driving platform, facing competition from active customers, responding to growth slowdown predictions
(48:06) Datacenters in space, AI healthcare, Robotics
(56:44) OpenAI/Anthropic revenue potential, how to build an AI moat
(59:38) Advice to young people on excelling in the AI era
 
(1:00) Acquiring Groq and the inference explosion

Thought this section was great - very interesting key bullet points:

Disaggregated Inference and Groq's Role
* AI data centers are turning into huge "computers" with masses of heterogeneous processors, connectivity and storage racks
* All to service disaggregated inference at the far lowest cost per token
* Managed by a data-center (token factory) OS, Dynamo
* Groq plays a key role as agents grow in usage for fast, low latency decode and tokens - Jensen advocating adding 25% Groq to existing and future AI data centers to better support agents
* Fast TTFT/TPS tokens are more valuable than slower ones. Pareto curves of cost/power per token vs interactivity (TTFT, TPS) are critical to understanding benefits of NVIDIA and best cost curves.

Three New Basic Flavors of Compute
* Data center training (and inference)
* Physical (physics-based AI)
* Edge AI

One caveat - Jensen says NVIDIA "invented" disaggregated inference, but I think the timeline was more complicated.
* 2023 - Researchers at Peking University and UCSD develop Distserve, the first disaggregated inference system
* Late 2024 - DeepSeek delivers fast commercial disaggregated inference that also uses multi-headed latent attention
* March 2025 - NVIDA rolls out more general open source disaggregation data center "OS" called Dynamo, that will work with the whole NVIDIA ecosystem and beyond.
 
Last edited:
regarding OpenClaw, i saw it bust out on x.com where everyone is talking about. its a good first step toward Jarvis from Iron man. we went from simple chatbot to agent.

we will need more chip if this next phase of AI pop off. golden age of semis industry?
 
we will need more chip if this next phase of AI pop off. golden age of semis industry?

If you listen to Jensen, Cerebras and SambaNova, agents are going to demand a lot more low latency, high memory bandwidth (decode) processors so agents can use models that "move beyond conversation-speed interaction toward speed of thought computing. "

 
while there is no question NV has dominate position, and a lot of are due to the vision and execution. Jensen does have a tendency to re-write part of the history. He is no alone. As they say, winners get to write history.

Would be exciting to see a viable eco-system compete with NV at scale across the stack. The customized silicon for different applications might be the key to drive a lot more design work down the road
 
Would be exciting to see a viable eco-system compete with NV at scale across the stack. The customized silicon for different applications might be the key to drive a lot more design work down the road
I see two interesting developments here.
* Even though NVIDIA is doing vertical development of rack level data center AI products, they are also building out a hardware and software ecosystem to fill out their data center product offerings. On the hardware side, a bunch of storage companies including Vast and HPE, are building KV mass storage products for the KV cache tiering. And pretty much all the open source model serving frameworks, like llm-d, SGLang, vLLM, and LLMcache have been enhanced and optimized to run leveraging all the benefits of NVIDIA’s AI factory open source “operating system”, Dynamo. NVIDA has really created an offering that is far different than the closed AI rack-level environments at Amazon, Google, etc.
* I’m wondering if/when we will see other rack-level AI data center hardware system providers, who are leveraging disaggregated inference, like Amazon/Cerebras or some of the Chinese CDPs, adopt Dynamo, kind of like every server environment uses Linux today.

If that happened, we could issue in a new era of hardware diversity and interoperability for AI data center.
 
Last edited:
Hmm....I thought that part of the interview could benefit from some "Domain-specific RL-training", but having only 3 weeks since the start perhaps even for NVIDIA's Vera Rubin too short a period ;)


Gemini March 2026:

1) As of 2026, it is estimated that over 30,000 to 50,000 books have been written specifically focusing on the origins, causes, and "outbreak" of World War I.

2) Origin-Specific Books: There are approximately 20,000 to 25,000 books dedicated primarily to the "Road to War" (1919–1939).

3) As of March 2026, historians and bibliographers estimate that more than 30,000 books have been written about the Vietnam War.

4) As of March 22, 2026, bibliographers estimate that between 8,000 and 12,000 unique books have been written about the 2003 Iraq War (Operation Iraqi Freedom). While this is fewer than the "classics" of the world wars, the literature on Iraq is unique because of its extreme concentration on political decision-making and personal tactical memoirs. The books on the 2003 Iraq War generally fall into four distinct "piles": The "Decision-Making" Pile (~2,500 books): These are the most famous works, focusing on how the war started (WMDs, intelligence, and the Bush administration).

5)
As of March 2026, these are the most highly regarded books for understanding the origins of conflict.


1. The "Big Picture" (Theoretical Classics)​

These books explain the structural reasons why the world is prone to violence, regardless of the specific leaders in power.

  • "Man, the State, and War" by Kenneth Waltz (1959): The "Bible" of IR theory. Waltz breaks down the causes of war into three "images": the individual (human nature), the state (internal government), and the international system (anarchy).
  • "The Origins of War and the Preservation of Peace" by Donald Kagan (1995): A brilliant comparative study. Kagan argues that wars are rarely about just "resources" or "land," but are driven by Honor, Fear, and Interest.
  • "Causes of War" by Stephen Van Evera (1999): Focuses on "offense-defense theory." He argues that war is most likely when leaders perceive that attacking is easier than defending, often leading to "preventative" strikes.

2. The "Human Nature" & Psychology Perspective​

These books look at the biological and cognitive reasons why humans, as a species, resort to organized violence.

  • "Why War?" by Richard Overy (2024): A recent masterpiece that explores the psychological and sociological roots of conflict, questioning if war is an inescapable part of the human condition.
  • "Behave: The Biology of Humans at Our Best and Worst" by Robert Sapolsky (2017): While not strictly a "war book," it is essential for 2026 readers. It explains how our brains categorize the world into "Us vs. Them," which is the neurological foundation for almost every war.
  • "War: How Conflict Shaped Us" by Margaret MacMillan (2020): MacMillan argues that war is not an aberration of civilization, but is actually woven into the very fabric of our languages, technologies, and social structures.

3. Modern & "2026-Relevant" Drivers​

As we enter the mid-2020s, new drivers like AI, resource scarcity, and economic "chokepoints" have changed the calculation of war.

  • "Chokepoints: How the Global Economy Became a Weapon of War" by Edward Fishman (2025): A critical read for understanding 2026 geopolitics. It explains how modern "war" often starts as economic strangulation (sanctions, tech bans) that eventually boils over into kinetic military action.
  • "Why Nations Fight" by Richard Ned Lebow (2010): A data-heavy study that analyzes every war since 1648. He finds that "Security" and "Material Interest" are actually minor motives; most wars are fought for "Standing" (Prestige) and "Revenge."
  • "The End of Everything" by Victor Davis Hanson (2024): A sobering look at "total war" and how prosperous societies can descend into barbarism. It serves as a stark warning in the current 2026 climate of fragile alliances.

............
 
Last edited:
Back
Top