UK's AI Safety Institute Funds Foundational Theory to Move Beyond Empirical Safety

Related Insights

Fund 'Neglected Approaches' in AI Safety to Discover Necessary New Ideas

The AI safety community acknowledges it lacks all the ideas needed to ensure a safe transition to AGI. This creates an imperative to fund 'neglected approaches'—unconventional, creative, and sometimes 'weird' research that falls outside the current mainstream paradigms but may hold the key to novel solutions.

AMA Part 2: Is Fine-Tuning Dead? How Am I Preparing for AGI? Are We Headed for UBI? & More!

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

Softmax's 'Organic Alignment' Views AI Safety as a Continuous Process, Not a Solved State

Emmett Shear reframes AI alignment away from a one-time problem to be solved. Instead, he presents it as an ongoing, living process of recalibration and learning, much like how human families or societies maintain cohesion. This challenges the common 'lock in values' approach in AI safety.

Controlling Tools or Aligning Creatures? Emmett Shear (Softmax) & Séb Krier (GDM), from a16z Show

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

Long-Term AI Safety Research Remains Valuable by Creating Automatable Alignment Plans

Research with long timelines (e.g., a "2063 scenario") is still worth pursuing, as these technical plans can be compressed into a short period by future AI assistants. Seeding these directions now raises the "waterline of understanding" for future AI-accelerated alignment efforts, making them viable even on shorter timelines.

Building & Scaling the AI Safety Research Community, with Ryan Kidd of MATS

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

OpenAI Uses Healthcare as a Concrete Grounding for Abstract AI Safety Research

OpenAI's health division serves a dual purpose: delivering societal benefits and providing a real-world, high-stakes environment for AI safety research. Problems like scalable oversight (supervising superhuman AI) move from theoretical exercises to practical necessities when models outperform physicians on narrow tasks, creating concrete feedback loops that accelerate safety progress.

Universal Medical Intelligence: OpenAI's Plan to Elevate Human Health, with Karan Singhal

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·3 months ago

Far.AI's Vertically Integrated Model Aims to Fix AI Safety's 'Dropped Baton' Problem

Unlike specialized non-profits, Far.AI covers the entire AI safety value chain from research to policy. This structure is designed to prevent promising safety ideas from being "dropped" between the research and deployment phases, a common failure point where specialized organizations struggle to hand off work.

Full-Stack AI Safety: Why Defense-in-Depth Might Work, with Far.AI CEO Adam Gleave

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·8 months ago

Eric Drexler’s Vision Offers AI Safety Through an Ecology of Narrow, Superhuman Agents

Instead of building a single, monolithic AGI, the "Comprehensive AI Services" model suggests safety comes from creating a buffered ecosystem of specialized AIs. These agents can be superhuman within their domain (e.g., protein folding) but are fundamentally limited, preventing runaway, uncontrollable intelligence.

My Positive Vision for the AI Future, from the Existential Hope Podcast

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·7 months ago

AI Safety Research Is Inherently Dual-Use, Inevitably Advancing AI Capabilities

Ryan Kidd argues that it's nearly impossible to separate AI safety and capabilities work. Safety improvements, like RLHF, make models more useful and steerable, which in turn accelerates demand for more powerful "engines." This suggests that pure "safety-only" research is a practical impossibility.

Building & Scaling the AI Safety Research Community, with Ryan Kidd of MATS

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

An 'FDA for AI' Would Shift the Safety Burden to Developers, Spurring a Research Boom

An FDA-style regulatory model would force AI companies to make a quantitative safety case for their models before deployment. This shifts the burden of proof from regulators to creators, creating powerful financial incentives for labs to invest heavily in safety research, much like pharmaceutical companies invest in clinical trials.

Supintelligence: To Ban or Not to Ban? Max Tegmark & Dean Ball join Liron Shapira on Doom Debates

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·6 months ago

UK's AI Safety Institute Acts as Both Government Risk Advisor and Active Threat Mitigator

The UK's AI Safety Institute (AISI) has two core functions. It channels research on frontier AI risks to UK and allied governments. It also actively mitigates threats by red-teaming models for developers and helping to drive real-world defenses like pandemic preparedness.

Situational Awareness in Government, with UK AISI Chief Scientist Geoffrey Irving

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·3 months ago

AI Welfare Research Complements AI Safety by Improving Model Interpretability

Efforts to understand an AI's internal state (mechanistic interpretability) simultaneously advance AI safety by revealing motivations and AI welfare by assessing potential suffering. The goals are aligned through the shared need to "pop the hood" on AI systems, not at odds.

The Movement That Wants Us to Care About AI Model Welfare

Odd Lots·7 months ago

Get your free personalized podcast brief

Related Insights