Individual teams within major AI labs often act responsibly within their constrained roles. However, the overall competitive dynamic and lack of coordination between companies leads to a globally reckless situation, where risks are accepted that no single, rational entity would endorse.
AI labs may initially conceal a model's "chain of thought" for safety. However, when competitors reveal this internal reasoning and users prefer it, market dynamics force others to follow suit, demonstrating how competition can compel companies to abandon safety measures for a competitive edge.
The development of AI won't stop because of game theory. For competing nations like the US and China, the risk of falling behind is greater than the collective risk of developing the technology. This dynamic makes the AI race an unstoppable force, mirroring the Cold War nuclear arms race and rendering calls for a pause futile.
Many top AI CEOs openly admit the extinction-level risks of their work, with some estimating a 25% chance. However, they feel powerless to stop the race. If a CEO paused for safety, investors would simply replace them with someone willing to push forward, creating a systemic trap where everyone sees the danger but no one can afford to hit the brakes.
AI leaders aren't ignoring risks because they're malicious, but because they are trapped in a high-stakes competitive race. This "code red" environment incentivizes patching safety issues case-by-case rather than fundamentally re-architecting AI systems to be safe by construction.
AI companies minimizing existential risk mirrors historical examples like the tobacco and leaded gasoline industries. Immense, long-term public harm was knowingly caused for comparatively small corporate gains, enabled by powerful self-deception and rationalization.
A fundamental tension within OpenAI's board was the catch-22 of safety. While some advocated for slowing down, others argued that being too cautious would allow a less scrupulous competitor to achieve AGI first, creating an even greater safety risk for humanity. This paradox fueled internal conflict and justified a rapid development pace.
The most significant barrier to creating a safer AI future is the pervasive narrative that its current trajectory is inevitable. The logic of "if I don't build it, someone else will" creates a self-fulfilling prophecy of recklessness, preventing the collective action needed to steer development.
Other scientific fields operate under a "precautionary principle," avoiding experiments with even a small chance of catastrophic outcomes (e.g., creating dangerous new lifeforms). The AI industry, however, proceeds with what Bengio calls "crazy risks," ignoring this fundamental safety doctrine.
When a highly autonomous AI fails, the root cause is often not the technology itself, but the organization's lack of a pre-defined governance framework. High AI independence ruthlessly exposes any ambiguity in responsibility, liability, and oversight that was already present within the company.
Regardless of potential dangers, AI will be developed relentlessly. Game theory dictates that any nation or company that pauses or slows down will be at a catastrophic disadvantage to competitors who don't. This competitive pressure ensures the technology will advance without brakes.