Today's LLMs Can't Handle Full APIs, Forcing Hand-Crafted MCP Tools

Related Insights

Designing LLM-Friendly APIs Is a New Ergonomics Challenge, Not Just an Engineering One

Making an API usable for an LLM is a novel design challenge, analogous to creating an ergonomic SDK for a human developer. It's not just about technical implementation; it requires a deep understanding of how the model "thinks," which is a difficult new research area.

MCP Servers: Teaching AI to Use the Internet Like Humans

AI & I·5 months ago

Dynamic MCPs Use a "Browse-and-Execute" Model to Manage Large APIs

To avoid overwhelming an LLM's context with hundreds of tools, a dynamic MCP approach offers just three: one to list available API endpoints, one to get details on a specific endpoint, and one to execute it. This scales well but increases latency and complexity due to the multiple turns required for a single action.

MCP Servers: Teaching AI to Use the Internet Like Humans

AI & I·5 months ago

Complex AI Products Require a Multi-Agent System to Avoid Context Rot

When building Spiral, a single large language model trying to both interview the user and write content failed due to "context rot." The solution was a multi-agent system where an "interviewer" agent hands off the full context to a separate "writer" agent, improving performance and reliability.

Spiral: Designing an AI Ghostwriter With Taste

AI & I·4 months ago

'Context Engineering' Has Replaced Simple Prompt Engineering in AI Development

The early focus on crafting the perfect prompt is obsolete. Sophisticated AI interaction is now about 'context engineering': architecting the entire environment by providing models with the right tools, data, and retrieval mechanisms to guide their reasoning process effectively.

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

a16z Podcast·3 months ago

A Single Code Execution Tool Is More Scalable Than a Large Set of MCP Tools

Instead of giving an LLM hundreds of specific tools, a more scalable "cyborg" approach is to provide one tool: a sandboxed code execution environment. The LLM writes code against a company's SDK, which is more context-efficient, faster, and more flexible than multiple API round-trips.

MCP Servers: Teaching AI to Use the Internet Like Humans

AI & I·5 months ago

Composable Platforms Enable Faster, Lighter Development of Specialized AI Tools

Using a composable, 'plug and play' architecture allows teams to build specialized AI agents faster and with less overhead than integrating a monolithic third-party tool. This approach enables the creation of lightweight, tailored solutions for niche use cases without the complexity of external API integrations, containing the entire workflow within one platform.

#764: Closing the gap between brand promise and brand experience with Mark Wagner, Horizontal Digital

The Agile Brand with Greg Kihlström®: Expert Mode Marketing Technology, AI, & CX·3 months ago

Software Documentation Is Evolving into an API for AI Coding Agents

Documentation is shifting from a passive reference for humans to an active, queryable context for AI agents. Well-structured docs on internal APIs and class hierarchies become crucial for agent performance, reducing inefficient and slow context window stuffing for faster code generation.

The $3 Trillion AI Coding Opportunity

a16z Show·2 months ago

AI Isn't in a Bubble; We're Underutilizing Models Due to a 'Capability Overhang'

The perceived limits of today's AI are not inherent to the models themselves but to our failure to build the right "agentic scaffold" around them. There's a "model capability overhang" where much more potential can be unlocked with better prompting, context engineering, and tool integrations.

20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·3 months ago

The 'Model Selector' Is an Obsolete Abstraction; Tightly-Coupled Agents Are the Future

The developer abstraction layer is moving up from the model API to the agent. A generic interface for switching models is insufficient because it creates a 'lowest common denominator' product. Real power comes from tightly binding a specific model to an agentic loop with compute and file system access.

⚡ Inside GitHub’s AI Revolution: Jared Palmer Reveals Agent HQ & The Future of Coding Agents

Latent Space: The AI Engineer Podcast·3 months ago

Enterprise AI Agents Are Complex Systems, Not Just LLMs with a Wrapper

Salesforce's Chief AI Scientist explains that a true enterprise agent comprises four key parts: Memory (RAG), a Brain (reasoning engine), Actuators (API calls), and an Interface. A simple LLM is insufficient for enterprise tasks; the surrounding infrastructure provides the real functionality.

How Salesforce Is Using AI to Power the Enterprise

AI & I·4 months ago