We scan new podcasts and send you the top 5 insights daily.
Trading compute futures requires more than tracking hardware supply. Software advancements, like model compression and optimization, can dramatically alter the utility and demand for older chips. A trader must understand how the software layer can make legacy hardware more capable over time, fundamentally changing supply-demand dynamics.
Separating inference into "prefill" (memory-bound) and "decode" (bandwidth-bound) tasks is a game-changer for hardware longevity. It allows older GPUs to be used for prefill tasks indefinitely, extending their useful economic life from 3-4 years to 10-15 years, a boon for data centers and their financiers.
AI software is improving so rapidly that older hardware, like a three-year-old NVIDIA inference chip, is now more profitable than it was when new. This phenomenon, where software advancements outpace hardware depreciation, is unprecedented and makes existing infrastructure increasingly valuable.
New AI models are designed to perform well on available, dominant hardware like NVIDIA's GPUs. This creates a self-reinforcing cycle where the incumbent hardware dictates which model architectures succeed, making it difficult for superior but incompatible chip designs to gain traction.
Despite the rapid pace of hardware innovation, the value of older NVIDIA GPUs like the H100 is holding strong. Cloud provider CoreWeave reports these chips are retaining 90-95% of their pricing power over a 5-6 year lifespan because compute demand far outstrips supply.
Contrary to typical hardware depreciation, GPUs like NVIDIA's H100 are becoming more valuable over time. This is because newer, more efficient AI models can generate significantly more output and value on the same hardware, tying the GPU's worth to its utility rather than its age.
While the industry standard is a six-year depreciation for data center hardware, analyst Dylan Patel warns this is risky for GPUs. Rapid annual performance gains from new models could render older chips economically useless long before they physically fail.
Andreessen highlights a unique economic phenomenon: the pace of AI software improvement outstrips hardware depreciation. This means a three-year-old NVIDIA inference chip can generate more revenue today than when it was new, a complete reversal of typical tech hardware value cycles.
Contrary to the belief that AI hardware becomes obsolete quickly, older GPUs like A100s will have a long depreciable life. As companies optimize costs, they'll use model routing to send simple queries to older, cheaper hardware, extending its utility for six to eight years.
Contrary to the assumption that customers only want the latest chips, Nvidia's older H200s are still being heavily purchased. This is because they fit the power profile of older data centers that cannot support the massive energy draw of newer systems, making them a more practical and immediately profitable choice for many operators.
A futures market for GPU compute is not viable yet because the product isn't fungible. The performance of an identical H100 chip varies significantly between cloud providers based on their proprietary software stack and operational excellence, measured by metrics like "goodput" and "MFUs."