The cost for a given level of AI capability has decreased by a factor of 100 in just one year. This radical deflation in the price of intelligence requires a complete rethinking of business models and future strategies, as intelligence becomes an abundant, cheap commodity.
The massive capital expenditure by hyperscalers on AI will likely create an oversupply of capacity. This will crash prices, creating a golden opportunity for a new generation of companies to build innovative applications on cheap AI, much like Amazon utilized the cheap bandwidth left after the dot-com bust.
The cost for a given level of AI performance halves every 3.5 months—a rate 10 times faster than Moore's Law. This exponential improvement means entrepreneurs should pursue ideas that seem financially or computationally unfeasible today, as they will likely become practical within 12-24 months.
A primary risk for major AI infrastructure investments is not just competition, but rapidly falling inference costs. As models become efficient enough to run on cheaper hardware, the economic justification for massive, multi-billion dollar investments in complex, high-end GPU clusters could be undermined, stranding capital.
Models like Gemini 3 Flash show a key trend: making frontier intelligence faster, cheaper, and more efficient. The trajectory is for today's state-of-the-art models to become 10x cheaper within a year, enabling widespread, low-latency, and on-device deployment.
Contrary to typical corporate fears, Microsoft's AI lead views the rapid commoditization of AI models and resulting price wars as a positive outcome for humanity. The ultimate goal is to make intelligence abundant and near-zero cost, with Microsoft's business model focused on value-added software integrations.
While the per-unit cost of using AI has plummeted, total enterprise spending has soared. This is a classic example of the Jevons paradox: efficiency gains and lower prices are unlocking entirely new use cases that were previously uneconomical, leading to a net increase in overall consumption and total expenditure.
Even for complex, multi-hour tasks requiring millions of tokens, current AI agents are at least an order of magnitude cheaper than paying a human with relevant expertise. This significant cost advantage suggests that economic viability will not be a near-term bottleneck for deploying AI on increasingly sophisticated tasks.
The internet leveled the playing field by making information accessible. AI will do the same for intelligence, making expertise a commodity. The new human differentiator will be the creativity and ability to define and solve novel, previously un-articulable problems.
As AI gets exponentially smarter, it will solve major problems in power, chip efficiency, and labor, driving down costs across the economy. This extreme efficiency creates a powerful deflationary force, which is a greater long-term macroeconomic risk than the current AI investment bubble popping.
Arvind Krishna forecasts a 1000x drop in AI compute costs over five years. This won't just come from better chips (a 10x gain). It will be compounded by new processor architectures (another 10x) and major software optimizations like model compression and quantization (a final 10x).