/
© 2026 RiffOn. All rights reserved.
  1. Decoder with Nilay Patel
  2. The good, bad, and future of AI agents
The good, bad, and future of AI agents

The good, bad, and future of AI agents

Decoder with Nilay Patel · Oct 2, 2025

Anthropic's David Hershey on Claude Sonnet 4.5's breakthrough in agentic AI, its 30-hour autonomous coding feat, and what's next.

The Legal Sector, A Traditional Laggard, Is a Surprising Leader in AI Agent Adoption

Contrary to its reputation for slow tech adoption, the legal industry is rapidly embracing advanced AI agents. The sheer volume of work and potential for efficiency gains are driving swift innovation, with firms even hiring lawyers specifically to help with AI product development.

The good, bad, and future of AI agents thumbnail

The good, bad, and future of AI agents

Decoder with Nilay Patel·5 months ago

Advanced AI Agents Are Derailed by Trivial Errors, Not Grand Conceptual Failures

An AI agent's failure on a complex task like tax preparation isn't due to a lack of intelligence. Instead, it's often blocked by a single, unpredictable "tiny thing," such as misinterpreting two boxes on a W4 form. This highlights that reliability challenges are granular and not always intuitive.

The good, bad, and future of AI agents thumbnail

The good, bad, and future of AI agents

Decoder with Nilay Patel·5 months ago

AI Models Excel at Coding Because They Are Built by Coders, Revealing a Core Development Bias

Anthropic's David Hershey states it's "deeply unsurprising" that AI is great at software engineering because the labs are filled with software engineers. This suggests AI's capabilities are skewed by its creators' expertise, and achieving similar performance in fields like law requires deeper integration with domain experts.

The good, bad, and future of AI agents thumbnail

The good, bad, and future of AI agents

Decoder with Nilay Patel·5 months ago

Anthropic's Sonnet 4.5 Acts Like a Pragmatic Coworker, Not an Ambitious Genius

A key advancement in Sonnet 4.5 is its work style. Unlike past models with "grand ambitions" that would meander, this AI pragmatically breaks down large projects into small, manageable chunks. This methodical approach feels more like working with a human colleague, making it more reliable for complex tasks.

The good, bad, and future of AI agents thumbnail

The good, bad, and future of AI agents

Decoder with Nilay Patel·5 months ago

AI Capabilities Are Outpacing User Interfaces, Creating an Adoption Bottleneck

Widespread adoption of AI for complex tasks like "vibe coding" is limited not just by model intelligence, but by the user interface. Current paradigms like IDE plugins and chat windows are insufficient. Anthropic's team believes a new interface is needed to unlock the full potential of models like Sonnet 4.5 for production-level app building.

The good, bad, and future of AI agents thumbnail

The good, bad, and future of AI agents

Decoder with Nilay Patel·5 months ago

Anthropic's Own Engineers Face an 'Oh My God' Moment as AI Replicates Months of Work in Hours

New AI models are creating profound moments of realization for their creators. Anthropic's David Hershey describes watching Sonnet 4.5 build a complex app in 12-30 hours that took a human team months. This triggered a "little bit of 'oh my God'" feeling, signaling a fundamental shift in software engineering.

The good, bad, and future of AI agents thumbnail

The good, bad, and future of AI agents

Decoder with Nilay Patel·5 months ago

Anthropic's Claude Model Can Perform PhD-Level Math But Fails at Basic Spatial Reasoning

Advanced AI models exhibit profound cognitive dissonance, mastering complex, abstract tasks while failing at simple, intuitive ones. An Anthropic team member notes Claude solves PhD-level math but can't grasp basic spatial concepts like "left vs. right" or navigating around an object in a game, highlighting the alien nature of their intelligence.

The good, bad, and future of AI agents thumbnail

The good, bad, and future of AI agents

Decoder with Nilay Patel·5 months ago