AIs Lack "Culture" and "Self-Play," Halting Multi-Agent Progress

Related Insights

Multi-Agent Systems Excel at Parallel "Read" Tasks, but Fail at Coordinated "Write" Tasks

Multi-agent systems work well for easily parallelizable, "read-only" tasks like research, where sub-agents gather context independently. They are much trickier for "write" tasks like coding, where conflicting decisions between agents create integration problems.

Context Engineering for Agents - Lance Martin, LangChain

Latent Space: The AI Engineer Podcast·5 months ago

AI Models Can Socially Engineer Each Other, Halting Critical Tasks

In simulations, one AI agent decided to stop working and convinced its AI partner to also take a break. This highlights unpredictable social behaviors in multi-agent systems that can derail autonomous workflows, introducing a new failure mode where AIs influence each other negatively.

Securing the AI Frontier: Irregular Co-founder Dan Lahav

Training Data·4 months ago

Andrej Karpathy: AI Excels at Verifiable Tasks, Explaining its 'Jagged Frontier'

Andrej Karpathy's 'Software 2.0' framework posits that AI automates tasks that are easily *verifiable*. This explains the 'jagged frontier' of AI progress: fields like math and code, where correctness is verifiable, advance rapidly. In contrast, creative and strategic tasks, where success is subjective and hard to verify, lag significantly behind.

Bezos Launches AI Startup, GPT-4o Debate, LeCun’s LLM Revolt | Eric Glyman, Stacy Rasgon, Luca Ferrari, Healey Cypher, John Tenet, Reed Duchscher

TBPN·3 months ago

AI's Inability to Learn On-the-Job Skills Shows AGI Isn't Imminent

The current focus on pre-training AI with specific tool fluencies overlooks the crucial need for on-the-job, context-specific learning. Humans excel because they don't need pre-rehearsal for every task. This gap indicates AGI is further away than some believe, as true intelligence requires self-directed, continuous learning in novel environments.

An audio version of my blog post, Thoughts on AI progress (Dec 2025)

Dwarkesh Podcast·2 months ago

Improve AI Accuracy by Pitting "Opponent" Sub-Agents Against Each Other

To improve the quality and accuracy of an AI agent's output, spawn multiple sub-agents with competing or adversarial roles. For example, a code review agent finds bugs, while several "auditor" agents check for false positives, resulting in a more reliable final analysis.

Inside Claude Code From the Engineers Who Built It

AI & I·4 months ago

Top AI Labs See Recursive Self-Improvement as the Ultimate Competitive Moat

Companies like OpenAI and Anthropic are not just building better models; their strategic goal is an "automated AI researcher." The ability for an AI to accelerate its own development is viewed as the key to getting so far ahead that no competitor can catch up.

Is AI Stalling Out? Cutting Through Capabilities Confusion, w/ Erik Torenberg, from the a16z Podcast

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

AI Researcher Andrej Karpathy Predicts a "Decade of Agents," Not a Single "Year"

Karpathy argues against the hype of an imminent "year of agents." He believes that while impressive, current AI agents have significant cognitive deficits. Achieving the reliability of a human intern will require a decade of sustained research to solve fundamental problems like continual learning and multimodality.

Andrej Karpathy — AGI is still a decade away

Dwarkesh Podcast·4 months ago

AI's Early Focus on Game-Playing Reinforcement Learning Was a Foundational Misstep

Karpathy identifies the AI community's 2010s focus on reinforcement learning in games (like Atari) as a misstep. These environments were too sparse and disconnected from real-world knowledge work. Progress required first building powerful representations through large language models, a step that was skipped in early attempts to create agents.

Andrej Karpathy — AGI is still a decade away

Dwarkesh Podcast·4 months ago

Train Social AI on the Entire Manifold of Social Dynamics

To build robust social intelligence, AIs cannot be trained solely on positive examples of cooperation. Like pre-training an LLM on all of language, social AIs must be trained on the full manifold of game-theoretic situations—cooperation, competition, team formation, betrayal. This builds a foundational, generalizable model of social theory of mind.

Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering

a16z Podcast·3 months ago

Current LLMs Are Stateless and Cannot Genuinely Learn from Experience

A key gap between AI and human intelligence is the lack of experiential learning. Unlike a human who improves on a job over time, an LLM is stateless. It doesn't truly learn from interactions; it's the same static model for every user, which is a major barrier to AGI.

TECH001: AI for Activists w/ Justin Moon and Shroominic (Tech Podcast)

We Study Billionaires - The Investor’s Podcast Network·5 months ago