The massive spike in demand for AI tokens is a direct result of the shift from users performing simple, assisted tasks to deploying autonomous agents. A single individual can now consume billions of tokens via agents running on their behalf, overwhelming the current supply of compute.
Beyond simple security concerns, the US government is poised to use its control over frontier AI model deployment to pursue broader strategic interests. Access could be withheld from allies to gain leverage in unrelated negotiations, such as trade deals, turning AI into a tool of foreign policy.
Frontier AI labs are restricting API access not just for security, but to prevent competitors from using 'distillation' to create cheap copies of their models. This practice makes it impossible to recoup massive R&D investments, forcing a move towards more restrictive, geopolitically motivated access.
Politicians aiming for equitable AI distribution by proposing moratoriums on data center construction would ironically increase inequality. This policy would create more compute scarcity, drive up costs, and ration access, ensuring only wealthy individuals and large corporations could afford frontier AI.
The tech industry wrongly compares AI to software, which has near-zero marginal costs for new users. In reality, providing access to frontier AI models is a zero-sum game during compute crunches because of immense computational requirements. Servicing another user is expensive, leading to rationed access.
To avoid being cut off from frontier AI, non-US countries can offer US hyperscalers incentives like subsidized energy for building data centers locally. In return, they can demand contractual guarantees for frontier model access, creating leverage against future US government-imposed restrictions.
