We scan new podcasts and send you the top 5 insights daily.
The current AI boom focuses on GPUs for "thinking" (Gen AI). The next phase, "Agentic AI" for "doing," will rely heavily on CPUs for task orchestration and memory for context, creating new investment opportunities in this previously overshadowed hardware.
Meta's multi-billion dollar deal to rent Amazon's Graviton 5 CPUs, not just GPUs, signals a potential architectural shift for AI. This move suggests that CPU architecture could be more efficient or cost-effective for agentic workloads, challenging the conventional wisdom that GPUs are the only viable hardware for scaling AI applications.
While GPUs dominate AI hardware discussions, the proliferation of AI agents is causing a significant, often overlooked, CPU shortage. Agents rely on CPUs for web queries, data processing, and other tasks needed to feed GPUs, straining existing infrastructure and driving new demand for companies like Arm and Intel.
AI's evolution from training-heavy (GPU-dominant) to inference- and agent-heavy (CPU-intensive) workflows could invert the traditional data center chip ratio. This represents a seismic shift, creating a massive tailwind for CPU manufacturers like Intel.
The focus on GPUs for AI overlooks a critical bottleneck: CPU shortages. AI agents require massive CPU power for non-GPU tasks like web queries and data prep. This demand is straining existing infrastructure and creating new market opportunities for CPU makers like ARM.
While GPUs are key for model training, the next AI wave of autonomous agents relies more on CPUs. The task of controlling and orchestrating multiple agents and tool calls is fundamentally a CPU-based process. This is creating a new hardware bottleneck and shifting focus to CPU manufacturers.
The focus on GPUs for AI overlooks a critical bottleneck: a growing CPU shortage. AI agents rely heavily on CPUs for orchestration tasks like tool calls, database queries, and web searches. This hidden demand is causing hyperscalers to lock in multi-year CPU supply contracts.
The AI compute narrative is shifting from GPUs for training to CPUs for agentic workflows. This creates a massive new demand for processors to orchestrate tasks, manage inference, and coordinate data centers, directly fueling Intel's comeback and flipping the expected CPU-to-GPU ratio.
The AI narrative has focused on GPUs for training, but the proliferation of AI agents for task execution is creating a massive, overlooked demand for CPUs. This shift to inference and orchestration is reversing Intel's recent decline.
After the current memory crunch, the next AI infrastructure bottleneck will be CPU and networking. The complex orchestration required for emerging agentic AI systems will strain these resources, a trend already visible in companies like Fastly seeing demand spikes just for workload orchestration.
While GPUs get the headlines, AI expert Tae Kim warns of a major coming CPU shortage. The complex orchestration, tool calls, and database queries required by AI agents are creating huge demand for CPU cores, a trend confirmed by major chipmakers and hyperscalers.