The latest models from Anthropic and OpenAI show a convergence in capabilities. The distinction between a "coding model" and a "general knowledge model" is blurring because the core skills for advanced software development—like planning and tool use—are the same skills needed to excel at any complex knowledge work.
AI agents built for coding are being used for general knowledge work like creating slide decks or analyzing health data. These agents autonomously write scripts to crawl websites, bypass bot protection, and analyze information, making them a superpower for any computer-based professional, not just developers.
Researchers from Anthropic, XAI, and Google are publicly stating that Claude's advanced coding abilities feel like a form of AGI, capable of replicating a year's worth of human engineering work in just one hour.
Anthropic dominated the crucial developer market by strategically focusing on coding, believing it to be the best predictor of a model's overall reasoning abilities. This targeted approach allowed their Claude models to consistently excel in this vertical, making agentic coding the breakout AI use case of the year and building an incredibly loyal developer following.
The latest models from Anthropic (Opus 4.6) and OpenAI (Codex 5.3) represent two distinct engineering methodologies. Opus is an autonomous agent you delegate to, while Codex is an interactive collaborator you pair-program with. Choosing a model is now a workflow decision, not just a performance one.
OpenAI has quietly launched "skills" for its models, following the same open standard as Anthropic's Claude. This suggests a future where AI agent capabilities are reusable and interoperable across different platforms, making them significantly more powerful and easier to develop for.
AI platforms using the same base model (e.g., Claude) can produce vastly different results. The key differentiator is the proprietary 'agent' layer built on top, which gives the model specific tools to interact with code (read, write, edit files). A superior agent leads to superior performance.
The ability to code is not just another domain for AI; it's a meta-skill. An AI that can program can build tools on demand to solve problems in nearly any digital domain, effectively simulating general competence. This makes mastery of code a form of instrumental, functional AGI for most economically valuable work.
The differing capabilities of new AI models align with distinct engineering roles. Anthropic's Opus 4.6 acts like a thoughtful "staff engineer," excelling at code comprehension and architectural refactors. In contrast, OpenAI's Codex 5.3 is the scrappy "founding engineer," optimized for rapid, end-to-end application generation.
Replit CEO Amjad Massad argues that the ability to write and execute code is a form of general intelligence. This insight suggests that building general-purpose coding agents will outperform handcrafting specialized, expert-knowledge agents for specific verticals, representing a more direct and scalable approach to achieving AGI.
To effectively interact with the world and use a computer, an AI is most powerful when it can write code. OpenAI's thesis is that even agents for non-technical users will be "coding agents" under the hood, as code is the most robust and versatile way for AI to perform tasks.