Prioritize Building Safety-Promoting AI to Outpace Riskier AI Development

Related Insights

Chinese Policymakers See AI Safety and Capabilities as Complements, Not a Trade-Off

Top Chinese officials use the metaphor "if the braking system isn't under control, you can't really step on the accelerator with confidence." This reflects a core belief that robust safety measures enable, rather than hinder, the aggressive development and deployment of powerful AI systems, viewing the two as synergistic.

Chinese AI – They're Just Like Us? With Beijing-Based Concordia AI CEO Brian Tse

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·8 months ago

The Most Promising AI Safety Plan Is Redirecting Superintelligent 'Labor' to Defensive Work

If society gets an early warning of an intelligence explosion, the primary strategy should be to redirect the nascent superintelligent AI 'labor' away from accelerating AI capabilities. Instead, this powerful new resource should be immediately tasked with solving the safety, alignment, and defense problems that it creates, such as patching vulnerabilities or designing biodefenses.

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

80,000 Hours Podcast·4 months ago

Focus AI Development on Two Key Levers: Epistemic (Truth) and Coordination (Collaboration) Tools

Impactful AI for societal decision-making can be categorized into two main types. Epistemic tools help us understand what is true (e.g., AI fact-checkers, forecasters), while coordination tools help groups cooperate (e.g., AI negotiators, verification systems). This provides a clear framework for targeted development.

Using AI to enhance societal decision making (article by Zershaaneh Qureshi)

80,000 Hours Podcast·3 months ago

An AI 'Pause' Should Be a Strategic Redirection of AI Labor, Not a Binary Stop

Framing an AI development pause as a binary on/off switch is unproductive. A better model is to see it as a redirection of AI labor along a spectrum. Instead of 100% of AI effort going to capability gains, a 'pause' means shifting that effort towards defensive activities like alignment, biodefense, and policy coordination, while potentially still making some capability progress.

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

80,000 Hours Podcast·4 months ago

AI Policy Debates Are Driven by Radically Different Baselines, Not Disagreements on Target Speed

AI accelerationists and safety advocates often appear to have opposing goals, but may actually desire a similar 10-20 year transition period. The conflict arises because accelerationists believe the default timeline is 50-100 years and want to speed it up, while safety advocates believe the default is an explosive 1-5 years and want to slow it down.

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

80,000 Hours Podcast·4 months ago

Counter Misuse of AI Decision Tools by Ensuring Widespread Access, Not Restriction

The risk of malicious actors using powerful AI decision tools is significant. The most effective countermeasure is not to restrict the technology, but to ensure it is widely and equitably distributed. This prevents any single group from gaining a dangerous strategic advantage over others.

Using AI to enhance societal decision making (article by Zershaaneh Qureshi)

80,000 Hours Podcast·3 months ago

AI Safety Research Is Inherently Dual-Use, Inevitably Advancing AI Capabilities

Ryan Kidd argues that it's nearly impossible to separate AI safety and capabilities work. Safety improvements, like RLHF, make models more useful and steerable, which in turn accelerates demand for more powerful "engines." This suggests that pure "safety-only" research is a practical impossibility.

Building & Scaling the AI Safety Research Community, with Ryan Kidd of MATS

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

The 'Use AI for Safety' Plan Fails with Unlucky Capability Ordering

A key failure mode for using AI to solve AI safety is an 'unlucky' development path where models become superhuman at accelerating AI R&D before becoming proficient at safety research or other defensive tasks. This could create a period where we know an intelligence explosion is imminent but are powerless to use the precursor AIs to prepare for it.

Every AI Company's Safety Plan is 'Use AI to Make AI Safe'. Is That Crazy? | Ajeya Cotra

80,000 Hours Podcast·4 months ago

Accelerate Commercially Viable AI Tools Because Early Arrival Is Critical for AGI Safety

Even if the market would eventually build decision-making tools, their impact is time-sensitive. Waiting for commercial rollout might mean they arrive after AGI, too late to help navigate the riskiest period. Therefore, philanthropic or impact-driven acceleration, even by a few months, is highly valuable.

Using AI to enhance societal decision making (article by Zershaaneh Qureshi)

80,000 Hours Podcast·3 months ago

Game Theory Dictates AI Development Will Not Slow; Pausing Creates an Advantage for Rivals

The race for AI supremacy is governed by game theory. Any technology promising an advantage will be developed. If one nation slows down for safety, a rival will speed up to gain strategic dominance. Therefore, focusing on guardrails without sacrificing speed is the only viable path.

New York’s Spending Crisis, Housing Unaffordability, & Conspiracy Corner: 9/11, Epstein, TV Producer Of Tehran Woes | Tom Bilyeu Show Live

Tom Bilyeu's Impact Theory·4 months ago

Get your free personalized podcast brief

Related Insights