Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

In the current supply-constrained market, the most critical question from customers is immediate availability. This allows new chip startups to gain market traction by designing architectures that avoid common bottlenecks like HBM and advanced packaging, even if it means sacrificing peak performance for speed to market.

Related Insights

Specialized AI cloud providers like CoreWeave face a unique business reality where customer demand is robust and assured for the near future. Their primary business challenge and gating factor is not sales or marketing, but their ability to secure the physical supply of high-demand GPUs and other AI chips to service that demand.

While focus is on massive supercomputers for training next-gen models, the real supply chain constraint will be 'inference' chips—the GPUs needed to run models for billions of users. As adoption goes mainstream, demand for everyday AI use will far outstrip the supply of available hardware.

Startups can make big bets on emerging workloads, like LLMs before they were proven. This is a product risk. In contrast, incumbents like Google or NVIDIA must ensure their next chip serves a wide range of existing customers, forcing them to be more conservative and avoid disruptive product bets.

AI software models advance every few months, creating exponential demand. However, the hardware infrastructure like chip fabs operates on two-to-four-year development cycles. This timeline disconnect between software's rapid pace and hardware's slow build-out creates a persistent supply crunch that money alone cannot instantly solve.

While energy supply is a concern, the primary constraint for the AI buildout may be semiconductor fabrication. TSMC, the leading manufacturer, is hesitant to build new fabs to meet the massive demand from hyperscalers, creating a significant bottleneck that could slow down the entire industry.

For a hyperscaler, the main benefit of designing a custom AI chip isn't necessarily superior performance, but gaining control. It allows them to escape the supply allocations dictated by NVIDIA and chart their own course, even if their chip is slightly less performant or more expensive to deploy.

The demand for AI processing power so vastly outstrips supply that it creates a "compute deficit." This forces major AI players to adopt any viable chip solution they can find, including from AMD. It's not about being better than NVIDIA; it's about being available, ensuring a market for second and third-tier suppliers.

The most significant aspect of OpenAI's Jalapeno chip isn't its performance but its rapid nine-month 'tape out' time. This demonstrates that using AI models to design hardware can dramatically shorten development cycles, creating a new competitive advantage based on iteration speed.

While training has been the focus, user experience and revenue happen at inference. OpenAI's massive deal with chip startup Cerebrus is for faster inference, showing that response time is a critical competitive vector that determines if AI becomes utility infrastructure or remains a novelty.

The current 2-3 year chip design cycle is a major bottleneck for AI progress, as hardware is always chasing outdated software needs. By using AI to slash this timeline, companies can enable a massive expansion of custom chips, optimizing performance for many at-scale software workloads.

Immediate Chip Availability Trumps Performance for New AI Hardware Startups | RiffOn