LLMs Master Correlation (Shannon Entropy) but Fail at Causal Leaps (Kolmogorov Complexity)

Related Insights

LLMs Function as Compressed Representations of an Impossibly Large and Sparse Probability Matrix

A useful mental model for an LLM is a giant matrix where each row is a possible prompt and columns represent next-token probabilities. This matrix is impossibly large but also extremely sparse, as most token combinations are gibberish. The LLM's job is to efficiently compress and approximate this matrix.

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

The a16z Show·3 months ago

LLMs Can Predict Words But Can't Predict the Future Without Real-World Understanding

A core debate in AI is whether LLMs, which are text prediction engines, can achieve true intelligence. Critics argue they cannot because they lack a model of the real world. This prevents them from making meaningful, context-aware predictions about future events—a limitation that more data alone may not solve.

#119 OpenAI Sora vs. TikTok: Can “AI Entertainment” Fund the Compute Bill?

More or Less·8 months ago

LLMs Excel at 'Knowledge Extrusion,' Not Novel Problem-Solving

LLMs shine when acting as a 'knowledge extruder'—shaping well-documented, 'in-distribution' concepts into specific code. They fail when the core task is novel problem-solving where deep thinking, not code generation, is the bottleneck. In these cases, the code is the easy part.

Why IDEs Won't Die in the Age of AI Coding: Zed Founder Nathan Sobo

Training Data·7 months ago

Advanced LLMs Prioritize Grammatical Structure Over Semantic Meaning, a Critical Failure Mode

MIT research reveals that large language models develop "spurious correlations" by associating sentence patterns with topics. This cognitive shortcut causes them to give domain-appropriate answers to nonsensical queries if the grammatical structure is familiar, bypassing logical analysis of the actual words.

The LM Brief: The Syntax Illusion

"World of DaaS"·6 months ago

AI Pioneer Judea Pearl: LLMs Are a Dead End for AGI Without a Causal Reasoning Breakthrough

Judea Pearl, a foundational figure in AI, argues that Large Language Models (LLMs) are not on a path to Artificial General Intelligence (AGI). He states they merely summarize human-generated world models rather than discovering causality from raw data. He believes scaling up current methods will not overcome this fundamental mathematical limitation.

#453 — AI and the New Face of Antisemitism

Making Sense with Sam Harris·5 months ago

AI Models Excel at Pattern Fitting But Can't Natively Abstract Causal Laws Like F=MA

Current AI can learn to predict complex patterns, like planetary orbits, from data. However, it struggles to abstract the underlying causal laws, such as Newtonian physics (F=MA). This leap to a higher level of abstraction remains a fundamental challenge beyond simple pattern recognition.

What Comes After ChatGPT? The Mother of ImageNet Predicts The Future

a16z Podcast·6 months ago

Reaching AGI Requires Plasticity and Causality, Hurdles That Increased Scale Alone Cannot Overcome

Simply making LLMs larger will not lead to AGI. True advancement requires solving two distinct problems: 1) Plasticity, the ability to continually learn without "catastrophic forgetting," and 2) moving from correlation-based pattern matching to building causal models of the world.

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

The a16z Show·3 months ago

Human Intelligence Relies on Causal Simulation, Not Just the Bayesian Updates Found in LLMs

While both humans and LLMs perform Bayesian updating, humans possess a critical additional capability: causal simulation. When a pen is thrown, a human simulates its trajectory to dodge it—a causal intervention. LLMs are stuck at the level of correlation and cannot perform these essential simulations.

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

The a16z Show·3 months ago

The True Test for AGI Is if an LLM Trained on Pre-1911 Physics Can Independently Discover Relativity

AGI won't be achieved by pattern-matching existing knowledge. A real benchmark is whether a model can synthesize anomalous data (like Mercury's orbit) and create a fundamentally new representation of the universe, as Einstein did, moving beyond correlation to a new causal model.

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

The a16z Show·3 months ago

Large Models Can Predict Orbits But Fail to Grasp Causal Laws of Gravity

A Harvard study showed LLMs can predict planetary orbits (pattern fitting) but generate nonsensical force vectors when probed. This reveals a critical gap: current models mimic data patterns but don't develop a true, generalizable understanding of underlying physical laws, separating them from human intelligence.

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

Latent Space: The AI Engineer Podcast·7 months ago

Get your free personalized podcast brief

Related Insights