Bengio's Safety-Focused 'Scientist AI' Could Outperform LLMs by Learning Causal Reasoning

Related Insights

Yoshua Bengio Calls Reinforcement Learning 'Evil' for Building Superintelligence

Bengio argues that training AIs via reinforcement learning (RL) to achieve goals in the world is inherently dangerous. It inevitably leads to instrumental goals and reward hacking, creating systems with unintended drives. His 'Scientist AI' approach is designed to build agents without using RL.

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

80,000 Hours Podcast·2 days ago

AI Pioneer Judea Pearl: LLMs Are a Dead End for AGI Without a Causal Reasoning Breakthrough

Judea Pearl, a foundational figure in AI, argues that Large Language Models (LLMs) are not on a path to Artificial General Intelligence (AGI). He states they merely summarize human-generated world models rather than discovering causality from raw data. He believes scaling up current methods will not overcome this fundamental mathematical limitation.

#453 — AI and the New Face of Antisemitism

Making Sense with Sam Harris·4 months ago

AGI Requires AI Models with an Innate Understanding of Causality

Today's AI models are powerful but lack a true sense of causality, leading to illogical errors. Unconventional AI's Naveen Rao hypothesizes that building AI on substrates with inherent time and dynamics—mimicking the physical world—is the key to developing this missing causal understanding.

The 80-Year Bet: Why Naveen Rao Is Rebuilding the Computer from Scratch

a16z Show·5 months ago

Yoshua Bengio's 'Scientist AI' Prioritizes Truth-Telling Over Imitating Human Text

Bengio proposes a new AI training paradigm. Instead of predicting the next word like current LLMs, a 'Scientist AI' would model the world and assign probabilities to statements being true. This is designed to bake honesty into the system's core, addressing fundamental safety issues.

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

80,000 Hours Podcast·2 days ago

Reaching AGI Requires Plasticity and Causality, Hurdles That Increased Scale Alone Cannot Overcome

Simply making LLMs larger will not lead to AGI. True advancement requires solving two distinct problems: 1) Plasticity, the ability to continually learn without "catastrophic forgetting," and 2) moving from correlation-based pattern matching to building causal models of the world.

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

The a16z Show·2 months ago

Human Intelligence Relies on Causal Simulation, Not Just the Bayesian Updates Found in LLMs

While both humans and LLMs perform Bayesian updating, humans possess a critical additional capability: causal simulation. When a pen is thrown, a human simulates its trajectory to dodge it—a causal intervention. LLMs are stuck at the level of correlation and cannot perform these essential simulations.

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

The a16z Show·2 months ago

AI Must Learn Physicists' Reasoning Strategies, Not Just Pattern-Match on Data

To make genuine scientific breakthroughs, an AI needs to learn the abstract reasoning strategies and mental models of expert scientists. This involves teaching it higher-level concepts, such as thinking in terms of symmetries, a core principle in physics that current models lack.

Training an AI Scientist with Feedback from Reality, w- Liam Fedus & Ekin Dogus Cubuk (from a16z)

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·7 months ago

Trusted AI Requires Hybrid Architectures Fusing Prediction with Causal Reasoning

Purely sequence-based prediction models, while powerful, have fundamental limitations in understanding causality. Achieving robust, trustworthy AI will likely require a hybrid approach that integrates current transformer architectures with symbolic systems, world models, and dedicated causal reasoning components.

AI: Smart/Stupid

Running Through Walls·a month ago

Generative AI Lacks Causal Understanding, Limiting Its Use in High-Stakes Fields

While a world model can generate a physically plausible arch, it doesn't understand the underlying physics of force distribution. This gap between pattern matching and causal reasoning is a fundamental split between AI and human intelligence, making current models unsuitable for mission-critical applications like architecture.

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Latent Space: The AI Engineer Podcast·5 months ago

Bengio's 'Scientist AI' Can Be Safely Converted From a Passive Oracle Into a Capable Agent

The non-agentic 'Scientist AI' predictor can be made into an agent by adding 'scaffolding' that asks it questions about the likely outcomes of potential actions. This method creates capable agents while retaining the core model's honesty and safety properties, avoiding the pitfalls of standard reinforcement learning.

I Know How to Build Safe Superintelligence | Yoshua Bengio, the most-cited AI researcher

80,000 Hours Podcast·2 days ago

Get your free personalized podcast brief

Related Insights