Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

The current model of paying per AI token is a temporary phase. Drawing a parallel to computing history, any resource constraint that requires payment eventually moves to the user's local device and becomes free. On-device AI processing will follow this pattern, ultimately eliminating token costs.

Related Insights

Metering AI usage by tokens is becoming unmanageable for non-technical departments like marketing, sales, and HR. The complexity of tracking usage and tying it to value will likely force a market shift toward flat-fee, unlimited usage plans priced on outcomes or per-employee value instead.

For years, flat-rate AI subscriptions heavily subsidized power users, masking the true cost of token consumption. As providers shift to usage-based billing, this subsidy is ending. Enterprises now face "sticker shock" and must justify AI spend with clear ROI, moving from rampant experimentation to cost-conscious implementation.

Relying on third-party APIs for AI is becoming unsustainable due to high token costs and the inherent security risk of uploading sensitive data. This will force a market shift toward powerful local hardware for running private, cost-effective models.

Intense demand for AI tokens is outstripping compute supply, making flat-rate SaaS pricing unsustainable. Companies like GitHub are now shifting to usage-based billing to cover escalating inference costs, marking a fundamental change in how AI products are sold and signaling a broader industry trend.

As more companies integrate AI, their costs are tied to variable usage (e.g., tokens, inference). This is causing a profound, economy-wide transformation away from predictable seat-based subscriptions towards more dynamic usage-based models to align costs with revenue.

Current AI pricing models, which pass on expensive LLM costs to users, are temporary. As LLM costs inevitably collapse and become commoditized, the winning companies will be those who have already evolved their monetization to be based on the value their product delivers.

The primary short-term risk for the AI sector isn't capital expenditure but the high cost of token generation. For AI applications to become ubiquitous, the unit economics must improve. If running a single query remains prohibitively expensive for businesses, widespread, sustainable adoption will be impossible, threatening the entire investment thesis.

The business model for AI is pivoting away from SaaS-style subscriptions. Enterprise-focused labs like Anthropic see massive revenue not from adding users, but from the immense token consumption of API power users. A single developer can be 100x more valuable than a subscriber, forcing a shift to consumption-based pricing.

The current affordability of AI tokens is not sustainable; it's propped up by venture capital funding AI companies operating at a loss. Businesses should treat this as a temporary window for aggressive learning and experimentation before prices inevitably rise to reflect true operational costs.

The evolution of AI towards complex, autonomous "agents" makes relying solely on the cloud slow and expensive, as users burn through token budgets. Nvidia's bet is that running these agents locally on powerful new PC chips will be faster and cheaper for consumers, driving a major hardware shift away from pure cloud computing.