BloombergGPT's Failure Shows Fine-Tuning Is Usually an Expensive Trap

Related Insights

Prioritize System-Level AI Memory Over Brittle Fine-Tuning for Enterprise Applications

Fine-tuning creates model-specific optimizations that quickly become obsolete. Blitzy favors developing sophisticated, system-level "memory" that captures enterprise-specific context and preferences. This approach is model-agnostic and more durable as base models improve, unlike fine-tuning which requires constant rework.

Infinite Code Context: AI Coding at Enterprise Scale w/ Blitzy CEO Brian Elliott & CTO Sid Pardeshi

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

AI Startups Create Value in the Application Layer, Not by Fine-Tuning Models

Early-stage AI startups should resist spending heavily on fine-tuning foundational models. With base models improving so rapidly, the defensible value lies in building the application layer, workflow integrations, and enterprise-grade software that makes the AI useful, allowing the startup to ride the wave of general model improvement.

20VC: From Only OpenAI to Die-Hard Anthropic: The Downfall of OpenAI in Enterprise | Harvey vs Legora: Legal AI is a Winner Take All | $7M ARR in a Single Day and Raising $200M Across 3 Rounds with No Deck with Max Junestrand, CEO @ Legora

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·5 months ago

Building Proprietary AI Tools Risks Creating an Obsolete, High-Cost System

The opportunity cost of building custom internal AI can be massive. By the time a multi-million dollar project is complete, off-the-shelf tools like ChatGPT are often far more capable, dynamic, and cost-effective, rendering the custom solution outdated on arrival.

#194: Agentic AI Timelines, Generalists vs. Specialists, Resume Tips, AI Learning Ownership, & Handling Model Updates

The Artificial Intelligence Show·5 months ago

AI Product Scaffolding Gets Eaten by More Advanced Models

The "bitter lesson" of AI applies to product development: complex scaffolding built around model limitations (like early vector stores or agent frameworks) will inevitably become obsolete as the models themselves get smarter and absorb those functions. Don't over-engineer solutions that a future model will solve natively.

“Engineers are becoming sorcerers” | The future of software development with OpenAI’s Sherwin Wu

Lenny's Podcast: Product | Career | Growth·5 months ago

OpenAI Prefers Prompt Optimization Over Fine-Tuning Due to Infrastructure Complexity

OpenAI favors "zero gradient" prompt optimization because serving thousands of unique, fine-tuned model snapshots is operationally very difficult. Prompt-based adjustments allow performance gains without the immense infrastructure burden, making it a more practical and scalable approach for both OpenAI and developers.

DevDay 2025: Apps SDK, Agent Kit, MCP, Codex and why Prompting is More Important than Ever

Latent Space: The AI Engineer Podcast·9 months ago

The 'Build vs. Buy' Framework Shifts Toward Buying Core AI Intelligence

The traditional wisdom to "build what's core" to your business is becoming obsolete for AI. The immense cost and rapid advancement of foundational models by major labs mean most companies are better off buying or partnering for core AI capabilities rather than attempting to build them in-house.

#199: AI Answers - Do Custom GPTs Still Matter? AI Output Validation, 2026 Job Disruption, Preventing Burnout, and Build vs. Buy

The Artificial Intelligence Show·4 months ago

Fine-Tuning Is a Niche Optimization, Not an Enterprise Starting Point

Fine-tuning remains relevant but is not the primary path for most enterprise use cases. It's a specialized tool for situations with unique data unseen by foundation models or when strict cost and throughput requirements for a high-volume task justify the investment. Most should start with RAG.

Bringing AI to Data: Agent Design, Text-2-SQL, RAG, & more, w- Snowflake VP of AI Baris Gultekin

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·6 months ago

Building Scaffolding Around LLMs Is a Losing Strategy Due to the "Bitter Lesson"

Richard Sutton's "Bitter Lesson" suggests general compute always wins. Applied to LLMs, building complex workflows or fine-tuning yields only temporary gains that the next-generation general model will erase. Always bet on the more general model.

Head of Claude Code: What happens after coding is solved | Boris Cherny

Lenny's Podcast: Product | Career | Growth·5 months ago

Longbeard CEO Finds Fine-Tuning Fails for Vertical AI Requiring Deep Theological Alignment

For use cases demanding strict fidelity to a complex knowledge domain like Catholic theology, fine-tuning existing models proves inadequate over the long tail of user queries. This necessitates the more expensive path of training a model from scratch.

What is Catholic AI? Technology Meets Theology, with Matthew Harvey Sanders, CEO of Longbeard

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·8 months ago

Enterprises Rarely Switch LLMs Due to High Re-Optimization Costs

Despite constant new model releases, enterprises don't frequently switch LLMs. Prompts and workflows become highly optimized for a specific model's behavior, creating significant switching costs. Performance gains of a new model must be substantial to justify this re-engineering effort.

Bringing AI to Data: Agent Design, Text-2-SQL, RAG, & more, w- Snowflake VP of AI Baris Gultekin

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·6 months ago

Get your free personalized podcast brief

Related Insights