The Shift to Agentic AI Requires a 10,000x Increase in Computation

Related Insights

The Entire History of Deep Learning Is a Story of Scaling Compute

The progression from early neural networks to today's massive models is fundamentally driven by the exponential increase in available computational power, from the initial move to GPUs to today's million-fold increases in training capacity on a single model.

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Latent Space: The AI Engineer Podcast·5 months ago

AI Scaling Laws Dictate a 10x Compute Increase Yields Only a 2x Capability Boost

The relationship between computing power and AI model capability is not linear. According to established 'scaling laws,' a tenfold increase in the compute used for training large language models (LLMs) results in roughly a doubling of the model's capabilities, highlighting the immense resources required for incremental progress.

AI’s Tangible Wins and Disruption

Thoughts on the Market·2 months ago

AI's 'Thinking Time' Boost Was a Costly, One-Off Trick, Not a Sustainable Trend

Over two-thirds of reasoning models' performance gains came from massively increasing their 'thinking time' (inference scaling). This was a one-time jump from a zero baseline. Further gains are prohibitively expensive due to compute limitations, meaning this is not a repeatable source of progress.

What the hell happened with AGI timelines in 2025?

80,000 Hours Podcast·3 months ago

Visual AI Models (VLMs) Will Require Up to 1000x More Compute Than Today's LLMs

Today's AI is largely text-based (LLMs). The next phase involves Visual Language Models (VLMs) that interpret and interact with the physical world for robotics and surgery. This transition requires an exponential, 50-1000x increase in compute power, underwriting the long-term AI infrastructure build-out.

AI Is Ushering in an Entirely New Economic Paradigm | Jordi Visser

Forward Guidance·5 months ago

Agentic AI Will Cause an Explosion in Inference Demand

The shift from simple chatbots (one user request, one API call) to agentic AI systems will decouple inference requests from direct user actions. A single user request could trigger hundreds or thousands of automated model calls, leading to an exponential increase in compute demand and cost.

Inference engineering and the real-world deployment of LLMs, with Philip Kiely

Complex Systems with Patrick McKenzie (patio11)·2 months ago

AI's Primary Constraint Has Shifted from Software Capabilities to Physical Infrastructure

The focus in AI has evolved from rapid software capability gains to the physical constraints of its adoption. The demand for compute power is expected to significantly outstrip supply, making infrastructure—not algorithms—the defining bottleneck for future growth.

Four Key Themes Shaping Markets in 2026

Thoughts on the Market·3 months ago

AI Inference Is Getting Harder Due to Scale, Diversity, and Agentic Workloads

Contrary to the idea that infrastructure problems get commoditized, AI inference is growing more complex. This is driven by three factors: (1) increasing model scale (multi-trillion parameters), (2) greater diversity in model architectures and hardware, and (3) the shift to agentic systems that require managing long-lived, unpredictable state.

Inferact: Building the Infrastructure That Runs Modern AI

The a16z Show·3 months ago

NVIDIA CEO: AI Compute Demand Is Driven by Three Compounding Scaling Laws, Not One

AI's computational needs are not just from initial training. They compound exponentially due to post-training (reinforcement learning) and inference (multi-step reasoning), creating a much larger demand profile than previously understood and driving a billion-X increase in compute.

NVIDIA: OpenAI, Future of Compute, and the American Dream | BG2 w/ Bill Gurley and Brad Gerstner

BG2Pod with Brad Gerstner and Bill Gurley·7 months ago

NVIDIA's Jensen Huang Predicts a Billion-Fold Decrease in AI Token Generation Cost Over 10 Years

Countering the narrative of insurmountable training costs, Jensen Huang argues that architectural, algorithmic, and computing stack innovations are driving down AI costs far faster than Moore's Law. He predicts a billion-fold cost reduction for token generation within a decade.

NVIDIA’s Jensen Huang on Reasoning Models, Robotics, and Refuting the “AI Bubble” Narrative

No Priors: Artificial Intelligence | Technology | Startups·4 months ago

'Agentic AI' That Executes Tasks Is the Next Transformational Leap Beyond ChatGPT

The next wave of AI is 'agentic,' meaning it can control a computer to execute commands and complete tasks, not just generate responses to prompts. This profound shift automates workflows like coding and administrative tasks, freeing humans for high-level creative and strategic work.

MacroVoices #523 Jim Bianco: Energy, FED & Economy in the wake of Iran conflict

Macro Voices·2 months ago

Get your free personalized podcast brief

Related Insights