Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

Instead of interacting with a single LLM, users will increasingly call an API that represents a "system as a model." Behind the scenes, this triggers a complex orchestration of multiple specialized models, sub-agents, and tools to complete a task, while maintaining a simple user experience.

Related Insights

The true power of the AI application layer lies in orchestrating multiple, specialized foundation models. Users want a single interface (like Cursor for coding) that intelligently routes tasks to the best model (e.g., Gemini for front-end, Codex for back-end), creating value through aggregation and workflow integration.

Anthropic's new "Agent Teams" feature moves beyond the single-agent paradigm by enabling users to deploy multiple AIs that work in parallel, share findings, and challenge each other. This represents a new way of working with AI, focusing on the orchestration and coordination of AI teams rather than just prompting a single model.

The distinction between a "model" and an "agent" is dissolving. Google's new Interactions API provides a single interface for both, signaling a future where flagship releases are complete systems out-of-the-box, capable of both simple queries and complex, long-running tasks, blurring the lines for developers and users.

Enterprises will shift from relying on a single large language model to using orchestration platforms. These platforms will allow them to 'hot swap' various models—including smaller, specialized ones—for different tasks within a single system, optimizing for performance, cost, and use case without being locked into one provider.

The 'agents vs. applications' debate is a false dichotomy. Future applications will be sophisticated, orchestrated systems that embed agentic capabilities. They will feature multiple LLMs, deterministic logic, and robust permission models, representing an evolution of software, not a replacement of it.

Breakthroughs will emerge from 'systems' of AI—chaining together multiple specialized models to perform complex tasks. GPT-4 is rumored to be a 'mixture of experts,' and companies like Wonder Dynamics combine different models for tasks like character rigging and lighting to achieve superior results.

Jerry Murdock predicts agents will use an orchestration layer to triage tasks, selecting the best LLM for each job—like expensive Claude for reasoning and cheap open-source models for simple tasks. This shifts value from the models themselves to the agent's intelligent orchestration capabilities.

Building one centralized AI model is a legacy approach that creates a massive single point of failure. The future requires a multi-layered, agentic system where specialized models are continuously orchestrated, providing checks and balances for a more resilient, antifragile ecosystem.

Salesforce's Chief AI Scientist explains that a true enterprise agent comprises four key parts: Memory (RAG), a Brain (reasoning engine), Actuators (API calls), and an Interface. A simple LLM is insufficient for enterprise tasks; the surrounding infrastructure provides the real functionality.

Powerful AI tools are becoming aggregators like Manus, which intelligently select the best underlying model for a specific task—research, data visualization, or coding. This multi-model approach enables a seamless workflow within a single thread, outperforming systems reliant on one general-purpose model.