We scan new podcasts and send you the top 5 insights daily.
Developing superintelligence is humanity's top priority. If achieved safely, it can solve other existential risks like climate change. If developed unsafely, it will dominate all other threats, making them irrelevant. In either scenario, superintelligence is the pivotal challenge that dictates the outcome of all others.
The development of superintelligence is unique because the first major alignment failure will be the last. Unlike other fields of science where failure leads to learning, an unaligned superintelligence would eliminate humanity, precluding any opportunity to try again.
Unlike advances in specific fields like rocketry or medicine, an advance in general intelligence accelerates every scientific domain at once. This makes Artificial General Intelligence (AGI) a foundational technology that dwarfs the power of all others combined, including fire or electricity.
Emmett Shear argues that even a successfully 'solved' technical alignment problem creates an existential risk. A super-powerful tool that perfectly obeys human commands is dangerous because humans lack the wisdom to wield that power safely. Our own flawed and unstable intentions become the source of danger.
The path to surviving superintelligence is political: a global pact to halt its development, mirroring Cold War nuclear strategy. Success hinges on all leaders understanding that anyone building it ensures their own personal destruction, removing any incentive to cheat.
If society gets an early warning of an intelligence explosion, the primary strategy should be to redirect the nascent superintelligent AI 'labor' away from accelerating AI capabilities. Instead, this powerful new resource should be immediately tasked with solving the safety, alignment, and defense problems that it creates, such as patching vulnerabilities or designing biodefenses.
The core AI risk argument is that a being much smarter than humans will alter the planet to suit its objectives, potentially causing our extinction. This mirrors how humans, as the "superintelligence of the natural world," have transformed the environment and driven other species to extinction.
AI will create negative consequences, like the internet spawned the dark web. However, its potential to solve major problems like disease and energy scarcity makes its development a net positive for society, justifying the risks that must be managed along the way.
There is a fundamental asymmetry in AI's impact. Benefits like new cancer drugs do not prevent catastrophic risks like an engineered pandemic. However, a catastrophic event makes a world with cancer drugs irrelevant. Therefore, downside mitigation must be the absolute priority.
The fundamental challenge of creating safe AGI is not about specific failure modes but about grappling with the immense power such a system will wield. The difficulty in truly imagining and 'feeling' this future power is a major obstacle for researchers and the public, hindering proactive safety measures. The core problem is simply 'the power.'
The AI safety community fears losing control of AI. However, achieving perfect control of a superintelligence is equally dangerous. It grants godlike power to flawed, unwise humans. A perfectly obedient super-tool serving a fallible master is just as catastrophic as a rogue agent.