We scan new podcasts and send you the top 5 insights daily.
A futures market for GPU compute is not viable yet because the product isn't fungible. The performance of an identical H100 chip varies significantly between cloud providers based on their proprietary software stack and operational excellence, measured by metrics like "goodput" and "MFUs."
The demand for AI tokens is growing faster than the supply of GPU infrastructure. This profound imbalance creates a market where not just top-tier AI labs, but also second and third-tier players will likely sell out their capacity. Superior models will command better margins, but the overall resource constraint means even lesser models will find customers.
The potential for a futures market in any asset, from onions to AI compute, depends on two factors. The product must be homogenous enough to standardize into a contract, and its price must be volatile enough to create demand for hedging from both producers and consumers.
Unlike typical computer hardware that depreciates rapidly, H100 GPUs are trading above their launch price in secondary markets. This market anomaly, driven by the extreme and sustained compute shortage for AI, completely inverts traditional financial models for hardware assets.
The widely discussed GPU supply crunch is only half the problem. There's a severe shortage of suppliers who can operate data centers with the high reliability and SLAs required for mission-critical inference. Out of many providers, only a handful meet the "gold tier" for operational excellence.
Despite the rapid pace of hardware innovation, the value of older NVIDIA GPUs like the H100 is holding strong. Cloud provider CoreWeave reports these chips are retaining 90-95% of their pricing power over a 5-6 year lifespan because compute demand far outstrips supply.
Previous attempts at tech futures like DRAM failed because prices only moved in one predictable direction: down. In contrast, the market for GPU compute will experience cycles of high demand and excess supply. This two-way volatility creates genuine hedging needs, making a futures market viable and necessary.
Contrary to typical hardware depreciation, GPUs like NVIDIA's H100 are becoming more valuable over time. This is because newer, more efficient AI models can generate significantly more output and value on the same hardware, tying the GPU's worth to its utility rather than its age.
The head of AI at Hudson River Trading highlights a practical barrier to creating a financial market for compute. For serious training, the minimum "lot size" is thousands of GPUs, not a small, fungible unit. This makes it difficult to standardize a contract and create liquidity, unlike commodities with smaller, interchangeable units.
A liquid futures market for GPU compute would create price transparency, threatening the business models of hyperscale cloud providers. These giants benefit from opaque, bundled pricing and controlling supply. They will naturally resist the standardization and transparency that an open futures market would bring.
Anthropic mitigates supply chain risk and optimizes cost by investing heavily in the ability to use NVIDIA, Google, and Amazon chips interchangeably for model development, internal use, and customer service. This orchestration layer is a key competitive advantage.