Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

OpenAI's CFO highlights a key dynamic: the cost of raw compute inputs (power, memory) is rising, but the cost to produce a unit of intelligence is falling dramatically, citing a 97% cost reduction from GPT-4 to 5.4. This deflationary curve is central to their financial modeling, allowing them to price future capacity and value creation more aggressively.

Related Insights

AI companies operate under the assumption that LLM prices will trend towards zero. This strategic bet means they intentionally de-prioritize heavy investment in cost optimization today, focusing instead on capturing the market and building features, confident that future, cheaper models will solve their margin problems for them.

The cost for a given level of AI capability has decreased by a factor of 100 in just one year. This radical deflation in the price of intelligence requires a complete rethinking of business models and future strategies, as intelligence becomes an abundant, cheap commodity.

A paradox exists where the cost for a fixed level of AI capability (e.g., GPT-4 level) has dropped 100-1000x. However, overall enterprise spend is increasing because applications now use frontier models with massive contexts and multi-step agentic workflows, creating huge multipliers on token usage that drive up total costs.

OpenAI's CFO argues that revenue growth has a nearly 1-to-1 correlation with compute expansion. This narrative frames fundraising not as covering losses, but as unlocking capped demand, positioning capital injection as a direct path to predictable revenue growth for investors.

The cost of AI, priced in "tokens by the drink," is falling dramatically. All inputs are on a downward cost curve, leading to a hyper-deflationary effect on the price of intelligence. This, in turn, fuels massive demand elasticity as more use cases become economically viable.

While the cost to achieve a fixed capability level (e.g., GPT-4 at launch) has dropped over 100x, overall enterprise spending is increasing. This paradox is explained by powerful multipliers: demand for frontier models, longer reasoning chains, and multi-step agentic workflows that consume exponentially more tokens.

OpenAI's GPT-5.5 is more expensive per token, but a new evaluation framework is emerging. The key metric isn't raw cost, but the model's efficiency in solving a problem. This 'intelligence per dollar' reframes cost analysis around performance and compute, where more expensive models can be cheaper overall if they solve tasks more efficiently.

Instead of viewing compute as a cost center, OpenAI treats it as a revenue generator, analogous to hiring salespeople. The core belief is that demand for AI capabilities is so vast that they can never build compute fast enough to satisfy it, justifying massive, forward-looking infrastructure investments.

While the cost for GPT-4 level intelligence has dropped over 100x, total enterprise AI spend is rising. This is driven by multipliers: using larger frontier models for harder tasks, reasoning-heavy workflows that consume more tokens, and complex, multi-turn agentic systems.

While cutting-edge AI is extremely expensive, its cost drops dramatically fast. A reasoning benchmark that cost OpenAI $4,500 per question in late 2024 cost only $11 a year later. This steep deflation curve means even the most advanced capabilities quickly become accessible to the mass market.