We scan new podcasts and send you the top 5 insights daily.
The rental prices for older NVIDIA GPUs, like the Hopper family and A100s, are increasing. This counterintuitive trend shows demand for AI compute is so far outstripping total supply that even previous-generation hardware is becoming more valuable, highlighting the severity of the GPU crunch.
The demand for AI tokens is growing faster than the supply of GPU infrastructure. This profound imbalance creates a market where not just top-tier AI labs, but also second and third-tier players will likely sell out their capacity. Superior models will command better margins, but the overall resource constraint means even lesser models will find customers.
Unlike typical computer hardware that depreciates rapidly, H100 GPUs are trading above their launch price in secondary markets. This market anomaly, driven by the extreme and sustained compute shortage for AI, completely inverts traditional financial models for hardware assets.
AI software is improving so rapidly that older hardware, like a three-year-old NVIDIA inference chip, is now more profitable than it was when new. This phenomenon, where software advancements outpace hardware depreciation, is unprecedented and makes existing infrastructure increasingly valuable.
The current AI moment is unique because demand outstrips supply so dramatically that even previous-generation chips and models remain valuable. They are perfectly suited for running smaller models for simpler, high-volume applications like voice transcription, creating a broad-based boom across the entire hardware and model stack.
Despite the rapid pace of hardware innovation, the value of older NVIDIA GPUs like the H100 is holding strong. Cloud provider CoreWeave reports these chips are retaining 90-95% of their pricing power over a 5-6 year lifespan because compute demand far outstrips supply.
Contrary to typical hardware depreciation, GPUs like NVIDIA's H100 are becoming more valuable over time. This is because newer, more efficient AI models can generate significantly more output and value on the same hardware, tying the GPU's worth to its utility rather than its age.
Andreessen highlights a unique economic phenomenon: the pace of AI software improvement outstrips hardware depreciation. This means a three-year-old NVIDIA inference chip can generate more revenue today than when it was new, a complete reversal of typical tech hardware value cycles.
Contrary to the assumption that customers only want the latest chips, Nvidia's older H200s are still being heavily purchased. This is because they fit the power profile of older data centers that cannot support the massive energy draw of newer systems, making them a more practical and immediately profitable choice for many operators.
Countering the narrative of rapid burnout, CoreWeave cites historical data showing a nearly 10-year service life for older NVIDIA GPUs (K80) in major clouds. Older chips remain valuable for less intensive tasks, creating a tiered system where new chips handle frontier models and older ones serve established workloads.
Contrary to expectations of easing supply, the GPU shortage has intensified since 2023. With clearer AI business models, mega-customers like OpenAI and Anthropic are spending even more aggressively, creating a fierce bidding war that pushes startups out.