RAG Is Evolving From Single-Shot Retrieval to Multi-Step Agentic Workflows

Related Insights

AI Coding Agents Are Abandoning Embedding-Based RAG for Simpler Agent-Based Search

Embedding-based RAG for code search is falling out of favor because its arbitrary chunking often fails to capture full semantic context. Simpler, more direct approaches like agent-based search using tools like `grep` are proving more reliable and scalable for retrieving relevant code without the maintenance overhead of embeddings.

⚡ [AIE CODE Preview] Inside Google Labs: Building The Gemini Coding Agent — Jed Borovik, Jules

Latent Space: The AI Engineer Podcast·3 months ago

LexisNexis Uses "Agentic AI" to Route Tasks to the Best-Performing LLM

Rather than relying on a single LLM, LexisNexis employs a "planning agent" that decomposes a complex legal query into sub-tasks. It then assigns each task (e.g., deep research, document drafting) to the specific LLM best suited for it, demonstrating a sophisticated, model-agnostic approach for enterprise AI.

LexisNexis CEO says the AI law era is already here

Decoder with Nilay Patel·4 months ago

Agentic AI Training Requires Simulated 'RL Environments,' Not Just Traditional RLHF

Training AI agents to execute multi-step business workflows demands a new data paradigm. Companies create reinforcement learning (RL) environments—mini world models of business processes—where agents learn by attempting tasks, a more advanced method than simple prompt-completion training (SFT/RLHF).

20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·3 months ago

90% of IBM's Enterprise Use Cases After ChatGPT Were RAG

According to IBM's AI Platform VP, Retrieval-Augmented Generation (RAG) was the killer app for enterprises in the first year after ChatGPT's release. RAG allows companies to connect LLMs to their proprietary structured and unstructured data, unlocking immense value from existing knowledge bases and proving to be the most powerful initial methodology.

AI Agents for PMs in 69 Minutes — Masterclass with IBM VP

Product Growth Podcast·5 months ago

Anthropic's Claude Code Ditched Vector Search for More Accurate "Agentic Search"

While vector search is a common approach for RAG, Anthropic found it difficult to maintain and a security risk for enterprise codebases. They switched to "agentic search," where the AI model actively uses tools like grep or find to locate code, achieving similar accuracy with a cleaner deployment.

Inside Claude Code From the Engineers Who Built It

AI & I·4 months ago

True AI Breakthroughs Are No Longer About Better Chat, But About Agentic Capabilities

While language models are becoming incrementally better at conversation, the next significant leap in AI is defined by multimodal understanding and the ability to perform tasks, such as navigating websites. This shift from conversational prowess to agentic action marks the new frontier for a true "step change" in AI capabilities.

Google Gemini 3 reactions, Google Antigravity, Anthropic-Nvidia-Microsoft Deal | Diet TBPN

TBPN·3 months ago

Google's AI Search Uses "Query Fanout" to Run Dozens of Background Searches for a Single Prompt

Unlike chatbots that rely solely on their training data, Google's AI acts as a live researcher. For a single user query, the model executes a 'query fanout'—running multiple, targeted background searches to gather, synthesize, and cite fresh information from across the web in real-time.

Inside Google's AI turnaround: The rise of AI Mode, strategy behind AI Overviews, and their vision for AI-powered search | Robby Stein (VP of Product, Google Search)

Lenny's Podcast: Product | Career | Growth·4 months ago

Accelerate AI Agent Development by Running Research and Scraping in Parallel

The agent development process can be significantly sped up by running multiple tasks concurrently. While one agent is engineering a prompt, other processes can be simultaneously scraping websites for a RAG database and conducting deep research on separate platforms. This parallel workflow is key to building complex systems quickly.

How to Build Multi-Agent AI Systems That Actually Work in Production | Tyler Fisk

Product Growth Podcast·4 months ago

Agentic Search Often Beats Complex Vector DBs for Code Retrieval

While complex RAG pipelines with vector stores are popular, leading code agents like Anthropic's Claude Code demonstrate that simple "agentic retrieval" using basic file tools can be superior. Providing an agent a manifest file (like `lm.txt`) and a tool to fetch files can outperform pre-indexed semantic search.

Context Engineering for Agents - Lance Martin, LangChain

Latent Space: The AI Engineer Podcast·5 months ago

Agentic AI Goes Beyond Chatbots to Provide "Augmented Digital Labor"

The next evolution of enterprise AI isn't conversational chatbots but "agentic" systems that act as augmented digital labor. These agents perform complex, multi-step tasks from natural language commands, such as creating a training quiz from a 700-page technical document.

Propel VP of Product Marketing on Building Products for High-Stakes Industries

Product Talk·2 months ago