Even Perfectly Aligned AIs Can't Solve Systemic Coordination Failures

Related Insights

Multi-Agent AI Systems Create Dangerous Echo Chambers That Amplify Errors

Pairing two AI agents to collaborate often fails. Because they share the same underlying model, they tend to agree excessively, reinforcing each other's bad ideas. This creates a feedback loop that fills their context windows with biased agreement, making them resistant to correction and prone to escalating extremism.

Can Grok and Claude run a business? We just did it

AI Pod by Wes Roth and Dylan Curious | Artificial Intelligence News and Interviews With Experts·2 months ago

A Rule-Following AI is Inherently Dangerous; True Safety Requires AI to Genuinely Care

Emmett Shear argues that an AI that merely follows rules, even perfectly, is a danger. Malicious actors can exploit this, and rules cannot cover all unforeseen circumstances. True safety and alignment can only be achieved by building AIs that have the capacity for genuine care and pro-social motivation.

Controlling Tools or Aligning Creatures? Emmett Shear (Softmax) & Séb Krier (GDM), from a16z Show

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Our Polarized Society Can't Handle AI's Utopian Success, Let Alone Its Failure

Even if AI is a perfect success with no catastrophic risk, our society may still crumble. We lack the political cohesion and shared values to agree on fundamental solutions like Universal Basic Income (UBI) that would be necessary to manage mass unemployment, turning a technological miracle into a geopolitical crisis.

#450 — More From Sam: Resolutions, Conspiracies, Demonology, and the Fate of the World

Making Sense with Sam Harris·2 months ago

A Perfectly Controlled Superintelligent AI is Still a Threat Due to Flawed Human Commands

Emmett Shear argues that even a successfully 'solved' technical alignment problem creates an existential risk. A super-powerful tool that perfectly obeys human commands is dangerous because humans lack the wisdom to wield that power safely. Our own flawed and unstable intentions become the source of danger.

Controlling Tools or Aligning Creatures? Emmett Shear (Softmax) & Séb Krier (GDM), from a16z Show

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

AI's 'King Midas Problem': Perfectly Achieving a Flawed Objective Leads to Catastrophe

King Midas wished for everything he touched to turn to gold, leading to his starvation. This illustrates a core AI alignment challenge: specifying a perfect objective is nearly impossible. An AI that flawlessly executes a poorly defined goal would be catastrophic not because it fails, but because it succeeds too well at the wrong task.

The Man Who Wrote The Book On AI: 2030 Might Be The Point Of No Return! We've Been Lied To About AI!

The Diary Of A CEO with Steven Bartlett·3 months ago

Forcing Consensus Among Diverse AI Models Is the Best Defense Against Misalignment

Rather than relying on a single AI, an agentic system should use multiple, different AI models (e.g., auditor, tester, coder). By forcing these independent agents to agree, the system can catch malicious or erroneous behavior from a single misaligned model.

The Internet Computer: Caffeine.ai CEO Dominic Williams on Unstoppable, Self-Writing Software

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·25 days ago

AI Alignment Isn't a Destination, It's a Continuous Process

Treating AI alignment as a one-time problem to be solved is a fundamental error. True alignment, like in human relationships, is a dynamic, ongoing process of learning and renegotiation. The goal isn't to reach a fixed state but to build systems capable of participating in this continuous process of re-knitting the social fabric.

Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering

a16z Podcast·3 months ago

AI Systems Fail from Flawed Societal Models, Not Inadequate Algorithms

AI systems often collapse because they are built on the flawed assumption that humans are logical and society is static. Real-world failures, from Soviet economic planning to modern systems, stem from an inability to model human behavior, data manipulation, and unexpected events.

949: Why AI Keeps Failing Society, with Stanford professor Alex “Sandy” Pentland

Super Data Science: ML & AI Podcast with Jon Krohn·2 months ago

AI Companies Are Locally Reasonable But Globally Reckless Due to Coordination Failures

Individual teams within major AI labs often act responsibly within their constrained roles. However, the overall competitive dynamic and lack of coordination between companies leads to a globally reckless situation, where risks are accepted that no single, rational entity would endorse.

2025 Highlight-o-thon: Oops! All Bests

80,000 Hours Podcast·2 months ago

A Perfectly Controlled Superintelligence Is Still Catastrophic

The AI safety community fears losing control of AI. However, achieving perfect control of a superintelligence is equally dangerous. It grants godlike power to flawed, unwise humans. A perfectly obedient super-tool serving a fallible master is just as catastrophic as a rogue agent.

Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering

a16z Podcast·3 months ago