We scan new podcasts and send you the top 5 insights daily.
The massive growth in AI token consumption isn't a sign of waste but of ambition. While the cost per "unit of intelligence" is decreasing, companies are immediately applying that efficiency to solve exponentially harder problems. Our appetite for more capable AI is growing faster than the cost is falling, leading to sustained, exponential spending.
While the cost-per-token is decreasing as models become more efficient, this efficiency gain drives a massive increase in new use cases and overall consumption. This economic principle, Jevons Paradox, explains why total enterprise spending on model inference is skyrocketing, even as the unit cost falls.
While the unit cost of AI inference has plummeted 50x, overall spending on AI is surging. This is a textbook example of Jevons paradox, where radical efficiency gains lead to increased consumption and higher total expenditure as new applications become economically viable.
While the cost per AI query drops, companies find more complex, compute-intensive uses for it. This elasticity of demand means total AI spending becomes a significant and variable operational expense, similar to a utility bill, rather than a predictable software cost.
A paradox exists where the cost for a fixed level of AI capability (e.g., GPT-4 level) has dropped 100-1000x. However, overall enterprise spend is increasing because applications now use frontier models with massive contexts and multi-step agentic workflows, creating huge multipliers on token usage that drive up total costs.
While the per-unit cost of using AI has plummeted, total enterprise spending has soared. This is a classic example of the Jevons paradox: efficiency gains and lower prices are unlocking entirely new use cases that were previously uneconomical, leading to a net increase in overall consumption and total expenditure.
The cost of AI, priced in "tokens by the drink," is falling dramatically. All inputs are on a downward cost curve, leading to a hyper-deflationary effect on the price of intelligence. This, in turn, fuels massive demand elasticity as more use cases become economically viable.
While the cost to achieve a fixed capability level (e.g., GPT-4 at launch) has dropped over 100x, overall enterprise spending is increasing. This paradox is explained by powerful multipliers: demand for frontier models, longer reasoning chains, and multi-step agentic workflows that consume exponentially more tokens.
Even as enterprises optimize AI spending for better ROI, overall spend will continue to grow rapidly. The adoption curve for new use cases and new enterprises is so steep that it overwhelms any efficiency gains from optimization, ensuring continued growth for model providers.
While the cost for GPT-4 level intelligence has dropped over 100x, total enterprise AI spend is rising. This is driven by multipliers: using larger frontier models for harder tasks, reasoning-heavy workflows that consume more tokens, and complex, multi-turn agentic systems.
While hardware gets cheaper (Moore's Law), the competitive pressure to release superior AI models leads to exponentially larger and more complex systems. This results in a higher number of "tokens burned" per query, making the cost of delivering a useful answer actually increase with each new generation.