Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

Baidu forgoes rigid policies allocating AI compute 'tokens' to employees based on seniority or title. The CFO argues the unit cost of compute drops so fast that such policies become obsolete in weeks. They prefer empowering talent with ample resources, trusting them to prioritize tasks efficiently in a nimble environment.

Related Insights

Current AI models are priced too cheaply, leading to inefficient consumption like using powerful models for simple tasks. As prices rise to reflect true costs, companies will need to optimize usage. This may create a new role, the 'Chief Token Officer,' responsible for allocating AI compute resources versus human capital.

As AI token costs become a significant line item, companies will shift from headcount-based budgets to dollar-based budgets. This will force managers to trade B-player employees in roles like QA or customer success to fund unlimited token access for their A-player engineers.

Traditional software budgeting fails for generative AI, where costs are variable and tied to tokens and usage. A CFO noted a team's daily per-person cost jumped 50% in one week. Companies must accept this volatility, run pilots to establish baseline costs, and then determine ROI, rather than trying to set a fixed budget upfront.

The most heated topic among Fortune 500 CIOs is no longer which AI model is most powerful, but how to manage unpredictable and soaring token costs. Companies are struggling to find the right strategies—from workload prioritization to user-based access tiers—to create a predictable cost model in a rapidly evolving tech landscape.

An anecdote about an engineer spending $100M in a month on AI tokens reveals a core enterprise issue. For Lenovo's CFO, the problem isn't the amount but its lack of planning and clear ROI. This signals a shift from predictable software subscriptions to volatile, usage-based AI compute costs.

To foster breakthrough ideas, companies should initially provide engineers with unrestricted access to the most powerful AI models, ignoring costs. Optimization should only happen after an idea proves its value at scale, as early cost-cutting stifles creativity.

Jensen Huang reframes AI compute as a productivity investment, not a cost. He would be "deeply alarmed" if a $500,000 engineer used less than $250,000 in tokens, comparing it to a chip designer refusing to use CAD tools. This sets a radical new benchmark for leveraging AI in high-skilled roles.

Lenovo's CFO notes a strategic divide. One school of thought uses tight constraints to see who innovates most efficiently. The other, common at US tech firms, gives high caps to let employees "go to town," believing this is the fastest way to discover high-ROI use cases and talent.

Jensen Huang argues that elite AI engineers should not be constrained by compute costs. He proposes a heuristic: if a $500k engineer isn't consuming at least $250k in tokens annually, their talent isn't being leveraged effectively. This reframes compute from a cost center to a critical force multiplier.

Giving teams a 'token budget' is flawed because it incentivizes generating low-value output to hit a quota, similar to bad hiring quotas. Instead, companies must tie token consumption directly to business KPIs. This reframes AI spend as a value-creating investment, not a cost to be managed.