We scan new podcasts and send you the top 5 insights daily.
The current compute crunch isn't just a supply issue. It's because new AI models are so much more capable that they unlock a total addressable market (TAM) of valuable tasks that grows exponentially, far outpacing the linear or geometric growth of compute supply.
The demand for AI tokens is growing faster than the supply of GPU infrastructure. This profound imbalance creates a market where not just top-tier AI labs, but also second and third-tier players will likely sell out their capacity. Superior models will command better margins, but the overall resource constraint means even lesser models will find customers.
The industry is fixated on the GPU shortage, but the proliferation of AI agents will create massive demand for general-purpose compute, leading to a CPU bottleneck. As millions of agents perform tasks, the availability of CPU cores—not just specialized processors—will become the primary constraint on growth for compute providers.
AI model capabilities follow a predictable, non-linear scaling law: increasing training compute by 10x roughly doubles a model's capabilities. This exponential relationship, rather than an incremental one, is what will drive underappreciated and disruptive advancements across many industries.
Counter-intuitively, as AI models become more efficient, the total consumption of compute resources will rise. This economic principle, Jevons Paradox, states that increased efficiency lowers costs, which in turn unlocks more applications and drives greater overall demand.
The relationship between computing power and AI model capability is not linear. According to established 'scaling laws,' a tenfold increase in the compute used for training large language models (LLMs) results in roughly a doubling of the model's capabilities, highlighting the immense resources required for incremental progress.
AI software models advance every few months, creating exponential demand. However, the hardware infrastructure like chip fabs operates on two-to-four-year development cycles. This timeline disconnect between software's rapid pace and hardware's slow build-out creates a persistent supply crunch that money alone cannot instantly solve.
Despite massive infrastructure investments, Greg Brockman believes demand for AI will consistently outstrip supply, leading to a long-term state of "compute scarcity." As AI tackles bigger problems like curing diseases, the appetite for computation will prove effectively infinite, making it a chronically scarce resource.
The focus in AI has evolved from rapid software capability gains to the physical constraints of its adoption. The demand for compute power is expected to significantly outstrip supply, making infrastructure—not algorithms—the defining bottleneck for future growth.
The value unlocked by frontier AI models is expanding so rapidly that there isn't enough hardware to meet demand. This scarcity ensures that not just the top lab (like OpenAI), but also second and third-tier competitors, will operate at full capacity with strong margins.
AI's computational needs are not just from initial training. They compound exponentially due to post-training (reinforcement learning) and inference (multi-step reasoning), creating a much larger demand profile than previously understood and driving a billion-X increase in compute.