Dynamic MCPs Use a "Browse-and-Execute" Model to Manage Large APIs

Related Insights

Designing LLM-Friendly APIs Is a New Ergonomics Challenge, Not Just an Engineering One

Making an API usable for an LLM is a novel design challenge, analogous to creating an ergonomic SDK for a human developer. It's not just about technical implementation; it requires a deep understanding of how the model "thinks," which is a difficult new research area.

MCP Servers: Teaching AI to Use the Internet Like Humans

AI & I·5 months ago

AI Tool Differentiation Now Lies in the 'Harness,' Not Just the Underlying LLM

Simply offering the latest model is no longer a competitive advantage. True value is created in the system built around the model—the system prompts, tools, and overall scaffolding. This 'harness' is what optimizes a model's performance for specific tasks and delivers a superior user experience.

Building the God Coding Agent

Latent Space: The AI Engineer Podcast·5 months ago

MCPs Lack a Critical Feedback Loop, Making It Hard to Know if Tools Are Useful

A major unsolved problem for MCP server providers is the lack of a feedback mechanism. When an AI agent uses a tool, the provider often doesn't know if the outcome was successful for the end-user. This "black box" makes iterating and improving the tools nearly impossible.

MCP Servers: Teaching AI to Use the Internet Like Humans

AI & I·5 months ago

A Single Code Execution Tool Is More Scalable Than a Large Set of MCP Tools

Instead of giving an LLM hundreds of specific tools, a more scalable "cyborg" approach is to provide one tool: a sandboxed code execution environment. The LLM writes code against a company's SDK, which is more context-efficient, faster, and more flexible than multiple API round-trips.

MCP Servers: Teaching AI to Use the Internet Like Humans

AI & I·5 months ago

Use Creative Generative AI for Design, But Deploy Predictable AI for Runtime Execution to Avoid Cost and Risk

Pega's CTO advises using the powerful reasoning of LLMs to design processes and marketing offers. However, at runtime, switch to faster, cheaper, and more consistent predictive models. This avoids the unpredictability, cost, and risk of calling expensive LLMs for every live customer interaction.

#763: Pega CTO Don Schuerman on how AI can pay down tech debt and accelerate digital transformation

The Agile Brand with Greg Kihlström®: Expert Mode Marketing Technology, AI, & CX·3 months ago

OpenAI Balances Connector Quality and Scale with a Dual 1P and 3P Strategy

OpenAI uses two connector types. First-party (1P) "sync connectors" store data to enable higher-quality, optimized experiences (e.g., re-ranking). Third-party (3P) MCP connectors provide broad, long-tail coverage but offer less control. This dual approach strategically trades off deep integration quality against ecosystem scale.

DevDay 2025: Apps SDK, Agent Kit, MCP, Codex and why Prompting is More Important than Ever

Latent Space: The AI Engineer Podcast·4 months ago

Building with LLMs Involves Navigating Three "Infinite Problem Spaces"

Developing LLM applications requires solving for three infinite variables: how information is represented, which tools the model can access, and the prompt itself. This makes the process less like engineering and more like an art, where intuition guides you to a local maxima rather than a single optimal solution.

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

Naive Agent Loops Rack Up Huge Costs and Hit Context Limits from Excessive Tool Call Data

The simple "tool calling in a loop" model for agents is deceptive. Without managing context, token-heavy tool calls quickly accumulate, leading to high costs ($1-2 per run), hitting context limits, and performance degradation known as "context rot."

Context Engineering for Agents - Lance Martin, LangChain

Latent Space: The AI Engineer Podcast·5 months ago

Today's LLMs Can't Handle Full APIs, Forcing Hand-Crafted MCP Tools

Exposing a full API via the Model Context Protocol (MCP) overwhelms an LLM's context window and reasoning. This forces developers to abandon exposing their entire service and instead manually craft a few highly specific tools, limiting the AI's capabilities and defeating the "do anything" vision of agents.

MCP Servers: Teaching AI to Use the Internet Like Humans

AI & I·5 months ago

LangChain's Lance Martin Defines Context Engineering as Prompt Engineering Plus Tool-Sourced Inputs

While prompt engineering focuses on crafting the human message, context engineering is a broader discipline that also manages the flow of information from a potentially large number of tool calls, a key challenge in building effective agents.

Context Engineering for Agents - Lance Martin, LangChain

Latent Space: The AI Engineer Podcast·5 months ago