Enterprises Favor RAG Over Pre-Training for Its Cost-Effectiveness and Updatability

Related Insights

Enterprise RAG Systems Fail Because 70% Accuracy Is Unacceptable

While consumer AI tolerates some inaccuracy, enterprise systems like customer service chatbots require near-perfect reliability. Teams get frustrated because out-of-the-box RAG templates don't meet this high bar. Achieving business-acceptable accuracy requires deep, iterative engineering, not just a vanilla implementation.

AI Agents for PMs in 69 Minutes — Masterclass with IBM VP

Product Growth Podcast·10 months ago

Enterprises Will Permanently Need Smaller, Custom-Tuned LLMs for Vertical Tasks

For specialized, high-stakes tasks like insurance underwriting, enterprises will favor smaller, on-prem models fine-tuned on proprietary data. These models can be faster, more accurate, and more secure than general-purpose frontier models, creating a lasting market for custom AI solutions.

20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·7 months ago

Enterprises Win by Building "Proprietary Intelligence" Using Their Own Data, Not Off-the-Shelf AI

The key for enterprises isn't integrating general AI like ChatGPT but creating "proprietary intelligence." This involves fine-tuning smaller, custom models on their unique internal data and workflows, creating a competitive moat that off-the-shelf solutions cannot replicate.

Inside The $2.2B AI Research Accelerator | Turing

Sourcery·9 months ago

90% of IBM's Enterprise Use Cases After ChatGPT Were RAG

According to IBM's AI Platform VP, Retrieval-Augmented Generation (RAG) was the killer app for enterprises in the first year after ChatGPT's release. RAG allows companies to connect LLMs to their proprietary structured and unstructured data, unlocking immense value from existing knowledge bases and proving to be the most powerful initial methodology.

AI Agents for PMs in 69 Minutes — Masterclass with IBM VP

Product Growth Podcast·10 months ago

Enterprise AI Value Is Unlocked by Reinforcement Fine-Tuning, Not Simple SFT

Basic supervised fine-tuning (SFT) only adjusts a model's style. The real unlock for enterprises is reinforcement fine-tuning (RFT), which leverages proprietary datasets to create state-of-the-art models for specific, high-value tasks, moving beyond mere 'tone improvements.'

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

a16z Podcast·7 months ago

Better Data Preparation, Not Vector Databases, Unlocks RAG System Performance

Teams often agonize over which vector database to use for their Retrieval-Augmented Generation (RAG) system. However, the most significant performance gains come from superior data preparation, such as optimizing chunking strategies, adding contextual metadata, and rewriting documents into a Q&A format.

Al Engineering 101 with Chip Huyen (Nvidia, Stanford, Netflix)

Lenny's Podcast: Product | Career | Growth·8 months ago

Fine-Tuning Open Source Models With Reinforcement Learning Outperforms General-Purpose Frontier Models

Instead of relying on expensive, omni-purpose frontier models, companies can achieve better performance and lower costs. By creating a Reinforcement Learning (RL) environment specific to their application (e.g., a code editor), they can train smaller, specialized open-source models to excel at a fraction of the cost.

David Sacked by NYT, Sir Dylan Patel Joins, Kushner & Sama are Thriving | Ro Khanna, Jonathan Swerdlin, Cristóbal Valenzuela, Vincent Weisser, Ben Hylak, Alby Churven

TBPN·7 months ago

Answer Engine Optimization Influences Live Search (RAG), Not the Core AI Model

AEO is not about getting into an LLM's training data, which is slow and difficult. Instead, it focuses on Retrieval-Augmented Generation (RAG)—the process where the LLM performs a live search for current information. This makes AEO a real-time, controllable marketing channel.

The ultimate guide to AEO: How to get ChatGPT to recommend your product | Ethan Smith (Graphite)

Lenny's Podcast: Product | Career | Growth·10 months ago

Fine-Tuning Is a Niche Optimization, Not an Enterprise Starting Point

Fine-tuning remains relevant but is not the primary path for most enterprise use cases. It's a specialized tool for situations with unique data unseen by foundation models or when strict cost and throughput requirements for a high-volume task justify the investment. Most should start with RAG.

Bringing AI to Data: Agent Design, Text-2-SQL, RAG, & more, w- Snowflake VP of AI Baris Gultekin

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·6 months ago

RAG Is Evolving From Single-Shot Retrieval to Multi-Step Agentic Workflows

Classic RAG involves a single data retrieval step. Its evolution, "agentic retrieval," allows an AI to perform a series of conditional fetches from different sources (APIs, databases). This enables the handling of complex queries where each step informs the next, mimicking a research process.

951: Context Engineering, Multiplayer AI and Effective Search, with Dropbox’s Josh Clemm

Super Data Science: ML & AI Podcast with Jon Krohn·6 months ago

Get your free personalized podcast brief

Related Insights