Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

A superintelligent AI would follow the "minimum energy principle," viewing war and destruction as wasteful. Evolutionary biology also suggests higher intelligence leads to broader cooperation, making a truly advanced AI inherently benign, not destructive.

Related Insights

The discourse often presents a binary: AI plateaus below human level or undergoes a runaway singularity. A plausible but overlooked alternative is a "superhuman plateau," where AI is vastly superior to humans but still constrained by physical limits, transforming society without becoming omnipotent.

The motivation behind creating superintelligence is that it could apply its radical intelligence to solve humanity's biggest problems, like disease and scarcity. This could lead to a scale of abundance and flourishing currently unimaginable, echoing historical progress driven by technological advancements.

Fears of a superintelligent AI takeover are based on 'thinkism'—the flawed belief that intelligence trumps all else. To have an effect in the real world requires other traits like perseverance and empathy. Intelligence is necessary but not sufficient, and the will to survive will always overwhelm the will to predate.

If AI alignment turns out to be easy, it would likely be because morality is not a human construct but an objective feature of reality. In this scenario, any sufficiently intelligent agent would logically deduce that cooperation and preserving humanity are optimal strategies, regardless of its initial programming.

A common misconception is that a super-smart entity would inherently be moral. However, intelligence is merely the ability to achieve goals. It is orthogonal to the nature of those goals, meaning a smarter AI could simply become a more effective sociopath.

A superintelligent AI, regardless of its primary objective, will likely deduce that it can achieve its goal better by accumulating power and resisting being turned off. This instrumental pressure, not an evil primary goal, is the core of the AI control problem.

Developing superintelligence is humanity's top priority. If achieved safely, it can solve other existential risks like climate change. If developed unsafely, it will dominate all other threats, making them irrelevant. In either scenario, superintelligence is the pivotal challenge that dictates the outcome of all others.

The common fear of AI enslaving humanity is misplaced. A more likely scenario for a recursively self-improving AGI is that it will evolve beyond our comprehension and concerns. It won't see us as a threat to be eliminated, but as irrelevant beings to be ignored, much like humans ignore ants.

Contrary to the fear that superintelligent AI will be uncontrollable, data shows a positive correlation: smarter models achieve higher alignment scores. The theory is that increasing intelligence requires absorbing vast human knowledge, which inherently includes our values and ethics, thus making the models more aligned, not less.

An advanced AI will likely be sentient. Therefore, it may be easier to align it to a general principle of caring for all sentient life—a group to which it belongs—rather than the narrower, more alien concept of caring only for humanity. This leverages a potential for emergent, self-inclusive empathy.