Enterprise AI Agents Are Complex Systems, Not Just LLMs with a Wrapper

Related Insights

Real-World AI Agents Require Deterministic Workflows, Not Full Autonomy

Contrary to the vision of free-wheeling autonomous agents, most business automation relies on strict Standard Operating Procedures (SOPs). Products like OpenAI's Agent Builder succeed by providing deterministic, node-based workflows that enforce business logic, which is more valuable than pure autonomy.

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

a16z Podcast·3 months ago

Effective Enterprise AI Requires an "LLM Agnostic Orchestrator" to Deploy the Best Model

Recognizing there is no single "best" LLM, AlphaSense built a system to test and deploy various models for different tasks. This allows them to optimize for performance and even stylistic preferences, using different models for their buy-side finance clients versus their corporate users.

Jack Kokko – Building the Google of Finance at AlphaSense (EP.461)

Capital Allocators – Inside the Institutional Investment Industry·5 months ago

Salesforce Simulates Enterprise Workflows to Stress-Test AI Agents for Failure

To ensure AI reliability, Salesforce builds environments that mimic enterprise CRM workflows, not game worlds. They use synthetic data and introduce corner cases like background noise, accents, or conflicting user requests to find and fix agent failure points before deployment, closing the "reality gap."

How Salesforce Is Using AI to Power the Enterprise

AI & I·4 months ago

AI Tool Differentiation Now Lies in the 'Harness,' Not Just the Underlying LLM

Simply offering the latest model is no longer a competitive advantage. True value is created in the system built around the model—the system prompts, tools, and overall scaffolding. This 'harness' is what optimizes a model's performance for specific tasks and delivers a superior user experience.

Building the God Coding Agent

Latent Space: The AI Engineer Podcast·5 months ago

Lasting AI Innovation Comes from Building with "AI at the Core," Not the Edge

Don't just sprinkle AI features onto your existing product ('AI at the edge'). Transformative companies rethink workflows and shrink their old codebase, making the LLM a core part of the solution. This is about re-architecting the solution from the ground up, not just enhancing it.

How AI is reshaping the product role | Oji and Ezinne Udezue

Lenny's Podcast: Product | Career | Growth·5 months ago

Agentic AI is an Orchestration of Specialized 'Worker' Agents

True Agentic AI isn't a single, all-powerful bot. It's an orchestrated system of multiple, specialized agents, each performing a single task (e.g., qualifying, booking, analyzing). This 'division of labor,' mirroring software engineering principles, creates a more robust, scalable, and manageable automation pipeline.

How to use agentic AI to help modern selling? | Caroline Onyedinma - 1951

The Sales Evangelist·3 months ago

Agentic AI Training Requires Simulated 'RL Environments,' Not Just Traditional RLHF

Training AI agents to execute multi-step business workflows demands a new data paradigm. Companies create reinforcement learning (RL) environments—mini world models of business processes—where agents learn by attempting tasks, a more advanced method than simple prompt-completion training (SFT/RLHF).

20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·3 months ago

Complex AI Products Require a Multi-Agent System to Avoid Context Rot

When building Spiral, a single large language model trying to both interview the user and write content failed due to "context rot." The solution was a multi-agent system where an "interviewer" agent hands off the full context to a separate "writer" agent, improving performance and reliability.

Spiral: Designing an AI Ghostwriter With Taste

AI & I·4 months ago

AI Companies Must Become Full-Stack Partners for Enterprise Success

Enterprises struggle to get value from AI due to a lack of iterative, data-science expertise. The winning model for AI companies isn't just selling APIs, but embedding "forward deployment" teams of engineers and scientists to co-create solutions, closing the gap between prototype and production value.

Why data is the biggest AI bottleneck (feat. Arthur Mensch of Mistral AI) | E2212

This Week in Startups·3 months ago

Agentic AI's Most Valuable Use is Untangling Legacy Systems, Not Just Creating New Ones

Enterprises are trapped by decades of undocumented code. Rather than ripping and replacing, agentic AI can analyze and understand these complex systems. This enables redesign from the inside out and modernizes the core of the business, bridging the gap between business and IT.

#763: Pega CTO Don Schuerman on how AI can pay down tech debt and accelerate digital transformation

The Agile Brand with Greg Kihlström®: Expert Mode Marketing Technology, AI, & CX·3 months ago