Build Reliable AI Systems Using Code for Rules and LLMs for Flexible Interpretation

Related Insights

Maximal AI Intelligence Means Using Reliable Tools, Not Re-learning Them

An LLM shouldn't do math internally any more than a human would. The most intelligent AI systems will be those that know when to call specialized, reliable tools—like a Python interpreter or a search API—instead of attempting to internalize every capability from first principles.

Meet Snowflake Intelligence: A Personalized Enterprise Intelligence Agent with Sridhar Ramaswamy

No Priors: Artificial Intelligence | Technology | Startups·6 months ago

LLM-First Approaches Shine in Demos But Fail in Production Without a Code-First Foundation

An 'LLM-first' approach, where the model handles core logic, creates impressive demos but lacks production reliability. A 'code-first' approach, using code for structure and LLMs for specific tasks, is less flashy but proves robust and debuggable in real-world applications.

Behind the Curtain: Why the Most Successful AI Apps are Actually Code-First.

Machine Learning Tech Brief By HackerNoon·4 days ago

ZocDoc Uses a 'Deterministic Orchestration Layer' to Safely Implement LLMs

To ensure reliability in healthcare, ZocDoc doesn't give LLMs free rein. It wraps them in a hybrid system where traditional, deterministic code orchestrates the AI's tasks, sets firm boundaries, and knows when to hand off to a human, preventing the 'praying for the best' approach common with direct LLM use.

Zocdoc CEO: "Dr. Google is going to be replaced by Dr. AI"

Decoder with Nilay Patel·7 months ago

Enterprise AI Requires Deterministic Guardrails on Probabilistic LLMs for High-Stakes Tasks

For critical enterprise functions like financial modeling, 99.9% accuracy from a probabilistic LLM is unacceptable. Platforms like Salesforce's Agent Force 360 solve this by layering deterministic logic and guardrails on top of the AI, ensuring compliance and preventing costly errors where even a 0.1% failure rate is too high.

984: Building AI Agents Where 99.9% Accuracy Isn't Good Enough, with Raju Malhotra

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

Use AI to Build Deterministic Tools, Not Just to Perform One-Off Tasks

Separate AI's role. Use an AI assistant to write reliable, deterministic code for structuring data (e.g., pulling Slack messages via API). Then, apply a live AI model only for the subjective task, like categorizing message urgency. This hybrid approach creates a more robust and controllable system.

I built a custom Slack inbox. It was easier than you’d think. | Yash Tekriwal (Clay)

How I AI·a month ago

High-Stakes Financial AI Agents Require Hybrid Systems, Not Just LLMs

Building reliable AI agents for finance, where accuracy is critical, requires moving beyond pure LLMs. Xero uses a hybrid system combining LLM-driven workflows with programmatic code and deep domain knowledge to ensure control and reliability that LLMs inherently lack.

Gemini Gem Masterclass From the Creator Lisa Huang

The Growth Podcast·2 months ago

Enterprise AI Agents Require Deterministic Scripting, Not Just Natural Language Prompts

Relying solely on natural language prompts like 'always do this' is unreliable for enterprise AI. LLMs struggle with deterministic logic. Salesforce developed 'AgentForce Script,' a dedicated language to enforce rules and ensure consistent, repeatable performance for critical business workflows, blending it with LLM reasoning.

956: From Agent Demo to Enterprise Product (with Ease!) feat. Salesforce’s Tyler Carlson

Super Data Science: ML & AI Podcast with Jon Krohn·4 months ago

Use Traditional Algorithms as 'Guardrails' to Ensure LLM Accuracy in Regulated Industries

To deploy LLMs in high-stakes environments like finance, combine them with deterministic checks. For example, use a traditional algorithm to calculate cash flow and only surface the LLM's answer if it falls within an acceptable range. This prevents hallucinations and ensures reliability.

Xero CPTO on Building an Agentic AI Platform to Manage Multiple Agents | Diya Jolly | E289

The Product Podcast·2 months ago

Isolate and Test AI Components to Mitigate 'Black Box' Risks in Complex Systems

Instead of treating a complex AI system like an LLM as a single black box, build it in a componentized way by separating functions like retrieval, analysis, and output. This allows for isolated testing of each part, limiting the surface area for bias and simplifying debugging.

Rerun: AI ethics advice from former White House technologist - Kasia Chmielinski (Co-Founder, The Data Nutrition Project)

The Product Experience·5 months ago

Get your free personalized podcast brief

Related Insights