Older Nvidia H200 GPUs Remain in High Demand for Power-Constrained Data Centers

Related Insights

AI Deals Are Now Measured in Gigawatts Because Power Is the Ultimate Constraint, Not Chips

The standard for measuring large compute deals has shifted from number of GPUs to gigawatts of power. This provides a normalized, apples-to-apples comparison across different chip generations and manufacturers, acknowledging that energy is the primary bottleneck for building AI data centers.

Trump Brokers Gaza Peace Deal, National Guard in Chicago, OpenAI/AMD, AI Roundtripping, Gold Rally

All-In with Chamath, Jason, Sacks & Friedberg·8 months ago

GPU Performance-Per-Watt Is Plateauing, Demanding New Architectures

The performance gains from Nvidia's Hopper to Blackwell GPUs come from increased size and power, not efficiency. This signals a potential scaling limit, creating an opportunity for radically new hardware primitives and neural network architectures beyond today's matrix-multiplication-centric models.

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Latent Space: The AI Engineer Podcast·6 months ago

Power Scarcity Benefits Top AI Chipmakers by Making Price Irrelevant

When power (watts) is the primary constraint for data centers, the total cost of compute becomes secondary. The crucial metric is performance-per-watt. This gives a massive pricing advantage to the most efficient chipmakers, as customers will pay anything for hardware that maximizes output from their limited power budget.

Gavin Baker - Nvidia v. Google, Scaling Laws, and the Economics of AI - [Invest Like the Best, EP.451]

Invest Like the Best with Patrick O'Shaughnessy·6 months ago

The Long-Term Value of Data Centers Is Secured by Running Future, More Efficient AI Models on Older Chips

The massive investment in data centers isn't just a bet on today's models. As AI becomes more efficient, smaller yet powerful models will be deployed on older hardware. This extends the serviceable life and economic return of current infrastructure, ensuring today's data centers will still generate value years from now.

AI 2025 → 2026 Live Show | Part 2

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

Rapid AI Chip Improvements Create a 'Build-Out Pause' Dilemma for Hyperscalers

Hyperscalers face a strategic challenge: building massive data centers with current chips (e.g., H100) risks rapid depreciation as far more efficient chips (e.g., GB200) are imminent. This creates a 'pause' as they balance fulfilling current demand against future-proofing their costly infrastructure.

Rage Baiting is for Losers, Everett Randle’s 5x Controversy | Diet TBPN

TBPN·7 months ago

AI Data Centers Will Evolve Beyond GPUs to Disaggregated, Task-Specific Chips

The intense power demands of AI inference will push data centers to adopt the "heterogeneous compute" model from mobile phones. Instead of a single GPU architecture, data centers will use disaggregated, specialized chips for different tasks to maximize power efficiency, creating a post-GPU era.

Qualcomm CEO Cristiano Amon: Future Of AI Devices, AI Fashion, Blending Reality and Computing

Big Technology Podcast·4 months ago

NVIDIA AI GPUs Have a 10-Year Economic Lifespan, Not a 3-Year Burnout

Countering the narrative of rapid burnout, CoreWeave cites historical data showing a nearly 10-year service life for older NVIDIA GPUs (K80) in major clouds. Older chips remain valuable for less intensive tasks, creating a tiered system where new chips handle frontier models and older ones serve established workloads.

Coreweave: AI Bubble Poster Child Or The Next Tech Giant? — With Michael Intrator and Brian Venturo

Big Technology Podcast·5 months ago

Future NVIDIA GPUs Will Require 600kW Per Rack, Forcing Radical Data Center Redesigns

Crusoe Cloud's CEO warns of an impending power density crisis. Today's racks are ~130kW, but NVIDIA's future "Vera Rubin Ultra" chips will demand 600kW per rack—the power of a small town. This massive leap will necessitate fundamental changes in cooling and electrical engineering for all AI infrastructure.

The Future of Everything: What CEOs of Circle, CrowdStrike & More See Coming in 2026

All-In with Chamath, Jason, Sacks & Friedberg·4 months ago

More Efficient AI Chips Ironically Increase System-Level Power Consumption

Efficiency gains in new chips like NVIDIA's H200 don't lower overall energy use. Instead, developers leverage the added performance to build larger, more complex models. This "ambition creep" negates chip-level savings by increasing training times and data movement, ultimately driving total system power consumption higher.

The LM Brief: The Energy Bottleneck Behind AI’s Growth

"World of DaaS"·7 months ago

GPU Depreciation Is Nuanced, Not Fraudulent

Accusations that hyperscalers "cook the books" by extending GPU depreciation misunderstand hardware lifecycles. Older chips remain at full utilization for less demanding tasks. High operational costs (power, cooling) provide a natural economic incentive to retire genuinely unprofitable hardware, invalidating claims of artificial earnings boosts.

Jason predicts a “major M&A moment” in the next six months! | E2213

This Week in Startups·6 months ago

Get your free personalized podcast brief

Related Insights