We scan new podcasts and send you the top 5 insights daily.
Currently, 80% of AI usage is human-initiated, but a crossover is expected this year where automated, background agentic tasks will dominate token consumption. This shift will decouple AI usage from human attention and create truly unbounded demand for inference, fundamentally changing the market.
The shift from human-in-the-loop AI use to autonomous agents is causing an explosion in API calls. An agent can hit an API over 100 times a day for a single task, compared to a human's 10, leading to a 3000% increase in token consumption and massive revenue growth for AI providers.
Contrary to expectations of falling AI costs, the move from simple chatbots to complex, multi-step agentic systems is causing an explosion in token usage. A single user can trigger hundreds of agents, making expensive frontier models economically unsustainable for many application-layer companies.
Contrary to the view that AI token intensity will drop after the initial coding boom, the move from simple queries to autonomous 'agentic' workflows will cause an order-of-magnitude (10x) increase in token usage per task. This applies across all knowledge-based jobs, ensuring sustained and explosive demand for compute.
The fundamental model of AI use is changing. It's moving from 'assisted' AI, which helps humans with their tasks, to 'agentic' AI, where autonomous systems perform tasks. This paradigm shift requires new methods for adoption, management, and measuring success, moving from 'seats' to 'tokens'.
Features like Codex's '/goal' create a new paradigm of persistent, autonomous agents that can work on a task for days. This shift from active human prompting to unattended 24/7 AI work is expected to cause an exponential increase in token consumption and compute demand, reinforcing the infrastructure boom.
The shift from simple chatbots (one user request, one API call) to agentic AI systems will decouple inference requests from direct user actions. A single user request could trigger hundreds or thousands of automated model calls, leading to an exponential increase in compute demand and cost.
The massive spike in demand for AI tokens is a direct result of the shift from users performing simple, assisted tasks to deploying autonomous agents. A single individual can now consume billions of tokens via agents running on their behalf, overwhelming the current supply of compute.
The next wave of AI adoption involves 'agentic' workflows, where AI performs complex tasks autonomously. This shift from simple queries to agentic use is expected to increase token consumption by approximately 10x per task. This will drive a massive explosion in compute demand across all knowledge-work industries, not just coding.
The largest driver of future energy consumption for AI won't be human-initiated queries on chatbots. Instead, it will be the massive, continuous "machine-to-machine" traffic generated by autonomous AI agents performing tasks, which will ultimately swamp human-AI interaction and create a runaway demand for compute power.
The transition from chatbots to autonomous 'agentic' AI represents a fundamental step-change. These agents, which execute complex tasks independently, have already increased the demand for computational power by 1000x, creating a massive, ongoing need for new infrastructure and hardware.