Navan Built Its Own AI Because General LLMs Can't Handle Zero-Hallucination Travel Tasks

Related Insights

Top AI Companies Build a Moat by Solving for Model 'Laziness'

The inconsistency and 'laziness' of base LLMs is a major hurdle. The best application-layer companies differentiate themselves not by just wrapping a model, but by building a complex harness that ensures the right amount of intelligence is reliably applied to a specific user task, creating a defensible product.

China's Acquisition Spree, TikTok's Survival Deal, Intel Slips | Tuhin Srivastava, Bryce Strauss, Max Spero, Russ d'Sa

TBPN·a month ago

The "Year of the Agent" is a Decade-Long Journey; Use "Agentic Workflows" Today

Fully autonomous agents are not yet reliable for complex production use cases because accuracy collapses when chaining multiple probabilistic steps. Zapier's CEO recommends a hybrid "agentic workflow" approach: embed a single, decisive agent within an otherwise deterministic, structured workflow to ensure reliability while still leveraging LLM intelligence.

INSIDE How AI Startups hire, AI Roundtable with Wade Foster, Mikey Schulman, and Ali Ansari | E2225

This Week in Startups·2 months ago

Enterprises Will Permanently Need Smaller, Custom-Tuned LLMs for Vertical Tasks

For specialized, high-stakes tasks like insurance underwriting, enterprises will favor smaller, on-prem models fine-tuned on proprietary data. These models can be faster, more accurate, and more secure than general-purpose frontier models, creating a lasting market for custom AI solutions.

20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·3 months ago

The LLM War Is a Distraction; Value Lies in Agentic Platforms That Orchestrate Models

Navan's CEO sees the debate over which LLM is best as unimportant because the infrastructure is becoming a commodity. The real value is created in the application layer. Navan's own agentic platform, Cognition, intelligently routes tasks to different models (OpenAI, Anthropic, Google) to get the best result for the job.

20VC: From $6.2BN Market Cap to $2.8BN: What Is Not Translating About Navan's Public Story | Are Any Public Company CEOs Actually Happy? | Why Navan Built It's Own Customer Service AI and What it Could Mean For Customer Service AI with Ariel Cohen

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·13 days ago

AI Startups Architected for Zero Hallucinations Will Win High-Stakes Industries

For applications in banking, insurance, or healthcare, reliability is paramount. Startups that architect their systems from the ground up to prevent hallucinations will have a fundamental advantage over those trying to incrementally reduce errors in general-purpose models.

Uncapped #40 | Vinod Khosla and Keith Rabois from Khosla Ventures

Uncapped with Jack Altman·a month ago

The True Moat for AI Agents is Mastering the Final 10% of Reliability

Anyone can build a simple "hackathon version" of an AI agent. The real, defensible moat comes from the painstaking engineering work to make the agent reliable enough for mission-critical enterprise use cases. This "schlep" of nailing the edge cases is a barrier that many, including big labs, are unmotivated to cross.

The 7 Most Powerful Moats For AI Startups

Lightcone Podcast·5 months ago

Airbnb Tackled High-Stakes Customer Service First in Its AI Strategy

Instead of starting with simple generative AI tasks, Airbnb focused on the most difficult application: resolving urgent customer issues like lockouts. This high-stakes approach allowed them to build a robust agent that can now be applied to less critical, "up-funnel" use cases like travel planning.

Airbnb CEO Brian Chesky on AI Strategy & New CTO, Microsoft’s Anthropic Deal | Jan 14, 2026

The Information's TITV·a month ago

Enterprise AI Agents Require Deterministic Scripting, Not Just Natural Language Prompts

Relying solely on natural language prompts like 'always do this' is unreliable for enterprise AI. LLMs struggle with deterministic logic. Salesforce developed 'AgentForce Script,' a dedicated language to enforce rules and ensure consistent, repeatable performance for critical business workflows, blending it with LLM reasoning.

956: From Agent Demo to Enterprise Product (with Ease!) feat. Salesforce’s Tyler Carlson

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

Effective AI Agents Require a Knowledge Graph Built from Your Platform's Unique Data

AI agents are simply 'context and actions.' To prevent hallucination and failure, they must be grounded in rich context. This is best provided by a knowledge graph built from the unique data and metadata collected across a platform, creating a powerful, defensible moat.

Scaling Product Organizations with an AI-first Approach

The Intentional Product Manager Podcast·a month ago

Train Your AI Agents on What Your Company Cannot Do to Prevent Hallucinations

To prevent AI agents from over-promising or inventing features, you must explicitly define negative constraints. Just as you train them on your capabilities, provide clear boundaries on what your product or service does not do to stop them from making things up to be helpful.

SaaStr 840: From 1 Agent to 20+: The Reality of Managing Multiple AI Agents Across Your GTM with SaaStr's CEO and CAIO

The Official SaaStr Podcast: SaaS | Founders | Investors·15 days ago