Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

Huawei is shifting from shrinking transistors (Moore's Law) to optimizing data flow via advanced chip stacking and interconnects. This "tau scaling law" is an innovative workaround to physical limits, aiming to create competitive AI compute power without access to the most advanced manufacturing processes.

Related Insights

Jensen Huang emphasizes that Moore's Law is dead as a primary performance driver. The 50x gain from Hopper to Blackwell came overwhelmingly from architecture and computer science breakthroughs, with raw transistor improvements providing only marginal benefit.

The next wave of AI silicon may pivot from today's compute-heavy architectures to memory-centric ones optimized for inference. This fundamental shift would allow high-performance chips to be produced on older, more accessible 7-14nm manufacturing nodes, disrupting the current dependency on cutting-edge fabs.

Faced with restrictions on advanced NVIDIA chips, China is leveraging its electricity advantage to run vast numbers of older-generation GPUs in parallel. This hardware constraint forces a focus on software, with Chinese labs developing sophisticated algorithms and compute methods to leapfrog the hardware deficit.

While NVIDIA's GPUs have been the primary AI constraint, the bottleneck is now moving to other essential subsystems. Memory, networking interconnects, and power management are emerging as the next critical choke points, signaling a new wave of investment opportunities in the hardware stack beyond core compute.

With Moore's Law over, computing progress now depends on networking vast numbers of chips. Lightmatter's photonic interconnects overcome the distance limits of copper cables, allowing thousands of GPUs kilometers apart to function as a single, cohesive supercomputer. This creates a new scaling vector for AI performance.

Faced with limited access to top-tier hardware, Chinese AI companies have been forced to innovate on model architecture to compete. They've developed superior techniques in memory management and multi-token prediction, making their models highly efficient and formidable competitors despite hardware constraints.

China is compensating for its deficit in cutting-edge semiconductors by pursuing an asymmetric strategy. It focuses on massive 'superclusters' of less advanced domestic chips and creating hyper-efficient, open-source AI models. This approach prioritizes widespread, low-cost adoption over chasing the absolute peak of performance like the US.

Unlike GPUs using slow, dense memory, Cerebras's wafer-sized chip leverages its vast surface area to accommodate faster, less-dense memory. This design sidesteps memory bottlenecks, achieving speeds up to 15 times faster than the fastest GPUs for AI tasks.

The long-term ability to scale AI compute is not constrained by power or data centers, but by the production of advanced semiconductors. The ultimate chokepoint is ASML, the world's only manufacturer of EUV lithography tools, which can only produce just over 100 units annually by 2030.

As AI models become commodities, the underlying hardware's speed and efficiency for inference is the true differentiator. The company that powers the fastest AI experiences will win, similar to how Google won with fast search, because there is no market for slow AI.