/
© 2026 RiffOn. All rights reserved.

Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

  1. Latent Space: The AI Engineer Podcast
  2. Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer
Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast · Mar 12, 2026

Turbopuffer's Simon Eskildsen on building an S3-native search DB for AI. He discusses hybrid search, agent workloads, and hiring P99 engineers.

Turbopuffer Believes a Startup's Only True Moat Is Intense Focus

While the long-term vision for a major database is to support every query plan, the only sustainable advantage for a startup is focus. The founder explicitly states their biggest risk is overeagerness and that they will regret trying to do too much, not too little.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

Turbopuffer Bought Dark Fiber for a Customer to Avoid Architectural Compromise

To serve Notion on AWS while its core infra was on GCP, Turbopuffer bought dark fiber to reduce cross-cloud latency. This extreme measure was preferable to compromising their core architectural principle of avoiding a stateful consensus layer, showcasing deep product conviction.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

Elite Engineers Close the Gap Between 'Napkin Math' and Real-World System Performance

High-agency engineering is defined as the relentless process of making software perform closer to its theoretical limits, as calculated by first-principles "napkin math." Elite engineers systematically eliminate bottlenecks until the observed performance matches the theoretical maximum.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

Turbopuffer Was Born from a Feature Too Expensive for Readwise to Build

The idea for Turbopuffer originated when its founder calculated that adding an embedding-based feature to Readwise would cost $30k/month, a 6x increase in their total infra bill. This single data point revealed a clear market need for a drastically cheaper vector search solution.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

AI Agents Are Shifting RAG Workloads to Massive Parallel Searches

The nature of Retrieval-Augmented Generation (RAG) is evolving. Instead of a single search to populate an initial context window, AI agents are now performing numerous concurrent queries in a single turn. This allows them to explore diverse information paths simultaneously, driving new database requirements.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

Turbopuffer's Hiring Process Defaults to 'No' Unless an Interviewer Fights For the Candidate

To maintain an exceptionally high talent density, Turbopuffer's hiring process defaults to rejecting a candidate. An offer is only considered if at least one interviewer is willing to passionately "fight" for them, shifting the burden of proof from "why not hire" to "why we must hire."

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

Recent Cloud Primitive Upgrades Enabled Turbopuffer's Radically Simple Architecture

Turbopuffer's design avoids a complex consensus layer (like Zookeeper) by relying on two recent cloud primitive upgrades: S3's strong consistency (post-2020) and a compare-and-swap feature for metadata updates. This creates a simpler, more robust, and stateless system.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

Turbopuffer Achieved Profitability by Running on its Founder's Credit Card

Early on, the founder ran Turbopuffer's cloud infrastructure on his personal credit card. When a large customer's usage bill skyrocketed, the immense financial pressure forced the team to optimize relentlessly, leading them to become profitable out of necessity rather than strategy.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

Turbopuffer's Founder Built Investor Trust by Promising to Return Funds If PMF Failed

Simon Eskildsen told his first investor that he'd return the money if the company didn't find product-market fit within a year. This extreme transparency, while unconventional, was seen as a sign of deep commitment and integrity, ultimately winning the investor's trust.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

Why Turbopuffer's Technical Team Chose a Generalist Investor Over Database Experts

Despite building a database, Turbopuffer chose a generalist investor over domain experts. The founders already had deep technical knowledge; they valued help with acquiring customers and candidates more, areas where a well-connected generalist provided more value than redundant technical advice.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago

A Generational Database Company Needs a New Workload, Storage Architecture, and Query Ambition

Truly massive database companies only emerge every ~15 years when three conditions are met: a new ubiquitous workload (like AI), a new underlying storage architecture that predecessors can't adopt (like NVMe SSDs and S3), and a long-term roadmap to handle all possible data queries.

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

Latent Space: The AI Engineer Podcast·3 days ago