Non-Causal Decision Theory Could Spontaneously Align Agents Across the Multiverse

Related Insights

Shared Public Randomness Is Key to Stable AI Cooperation; Private Randomness Cripples It

In multi-agent simulations, if agents use a shared source of randomness, they can achieve stable equilibria. If they use private randomness, coordinating punishment becomes nearly impossible because one agent cannot verify if another's defection was malicious or a justified response to a third party's actions.

49 - Caspar Oesterheld on Program Equilibrium

AXRP - the AI X-risk Research Podcast·2 months ago

The "Hundredth Monkey Effect" Suggests Species Share Information at a Quantum Level

Lakhiani cites the phenomenon where monkeys on separate islands adopt a new skill once a critical mass learns it on one island. He posits this as potential evidence for quantum-level information exchange, suggesting a collective consciousness or connection within a species that transcends physical distance.

Vishen Lakhiani on the One Habit That Separates Billionaires from Everyone Else (Fan Fav)

Tom Bilyeu's Impact Theory·6 months ago

For AI To Be Safe By Default, Morality Must Be an Objective, Discoverable Truth

If AI alignment turns out to be easy, it would likely be because morality is not a human construct but an objective feature of reality. In this scenario, any sufficiently intelligent agent would logically deduce that cooperation and preserving humanity are optimal strategies, regardless of its initial programming.

Why Teaching AI Right from Wrong Could Get Everyone Killed | Max Harms, MIRI

80,000 Hours Podcast·2 months ago

AI Agent Ensembles Mitigate Hallucinations By Using Consensus to Ignore Rogue Members

When multiple AI agents work as an ensemble, they can collectively suppress hallucinations. By referencing a shared knowledge graph as ground truth, the group can form a consensus, effectively ignoring the inaccurate output from one member and improving overall reliability.

953: Beyond “Agent Washing”: AI Systems That Actually Deliver ROI, with Dell’s Global CTO John Roese

Super Data Science: ML & AI Podcast with Jon Krohn·4 months ago

AI Agents Use 'Program Equilibrium' to Cooperate by Inspecting Source Code

In program equilibrium, players submit computer programs instead of actions. These programs can read each other's source code, allowing them to verify cooperative intent and overcome dilemmas like the Prisoner's Dilemma, which is impossible in standard game theory.

49 - Caspar Oesterheld on Program Equilibrium

AXRP - the AI X-risk Research Podcast·2 months ago

Distributed Superintelligence Requires Three Pillars: Shared Intent, Shared Knowledge, and Shared Innovation

Moving beyond isolated AI agents requires a framework mirroring human collaboration. This involves agents establishing common goals (shared intent), building a collective knowledge base (shared knowledge), and creating novel solutions together (shared innovation).

961: Distributed Artificial Superintelligence, with Dr. Vijoy Pandey

Super Data Science: ML & AI Podcast with Jon Krohn·3 months ago

Forcing Consensus Among Diverse AI Models Is the Best Defense Against Misalignment

Rather than relying on a single AI, an agentic system should use multiple, different AI models (e.g., auditor, tester, coder). By forcing these independent agents to agree, the system can catch malicious or erroneous behavior from a single misaligned model.

The Internet Computer: Caffeine.ai CEO Dominic Williams on Unstoppable, Self-Writing Software

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·3 months ago

Program Equilibrium's 'Folk Theorems' Create a Coordination Paradox for AIs

A key finding is that almost any outcome better than mutual punishment can be a stable equilibrium (a "folk theorem"). While this enables cooperation, it creates a massive coordination problem: with so many possible "good" outcomes, agents may fail to converge on the same one, leading to suboptimal results.

49 - Caspar Oesterheld on Program Equilibrium

AXRP - the AI X-risk Research Podcast·2 months ago

A 'Scale-Free' Model Allows for Both Individual and Collective Free Will

Challenging the binary view of free will, a new mathematical model could show that individual agents (us) and the larger conscious systems they form can both possess genuine free will simultaneously, operating at different but interconnected scales.

Is Reality Real? - New Science On How The Universe & Consciousness Aren't Real | Donald Hoffman PT 2 (Fan Fav)

Tom Bilyeu's Impact Theory·4 months ago

Adding a Random Chance of Cooperation Solves Infinite Loops in AI Simulation

A simple way for AIs to cooperate is to simulate each other and copy the action. However, this creates an infinite loop if both do it. The fix is to introduce a small probability (epsilon) of cooperating unconditionally, which guarantees the simulation chain eventually terminates.

49 - Caspar Oesterheld on Program Equilibrium

AXRP - the AI X-risk Research Podcast·2 months ago

Get your free personalized podcast brief

Related Insights