Scaling Laws Can't Cure Cancer Because the Required Knowledge Isn't on the Internet

Related Insights

Build General AI by First Mastering and Incrementally Expanding from Narrow Domains

The path to a general-purpose AI model is not to tackle the entire problem at once. A more effective strategy is to start with a highly constrained domain, like generating only Minecraft videos. Once the model works reliably in that narrow distribution, incrementally expand the training data and complexity, using each step as a foundation for the next.

This AI Makes a Video Game World in 40 Milliseconds

AI & I·6 months ago

Physical Experiments, Not Just Software, Bottleneck AI

A "software-only singularity," where AI recursively improves itself, is unlikely. Progress is fundamentally tied to large-scale, costly physical experiments (i.e., compute). The massive spending on experimental compute over pure researcher salaries indicates that physical experimentation, not just algorithms, remains the primary driver of breakthroughs.

The 2045 Superintelligence Timeline: Epoch AI’s Data-Driven Forecast

a16z Podcast·3 months ago

AI for Science Fails on Public Data Due to Noise and Missing Negative Results

Foundation models can't be trained for physics using existing literature because the data is too noisy and lacks published negative results. A physical lab is needed to generate clean data and capture the learning signal from failed experiments, which is a core thesis for Periodic Labs.

Training an AI Scientist with Feedback from Reality, w- Liam Fedus & Ekin Dogus Cubuk (from a16z)

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

Foundational Scientific Datasets Follow a 'Slow Then Fast' Innovation Cycle

Building the first large-scale biological datasets, like the Human Cell Atlas, is a decade-long, expensive slog. However, this foundational work creates tools and knowledge that enable subsequent, larger-scale projects to be completed exponentially faster and cheaper, proving a non-linear path to discovery.

Priscilla Chan and Mark Zuckerberg: Frontier AI + Virtual Biology To Solve All Diseases

Latent Space: The AI Engineer Podcast·3 months ago

AI's 'Age of Scaling' Is Over; We're Back to the 'Age of Research'

The era of guaranteed progress by simply scaling up compute and data for pre-training is ending. With massive compute now available, the bottleneck is no longer resources but fundamental ideas. The AI field is re-entering a period where novel research, not just scaling existing recipes, will drive the next breakthroughs.

Dwarkesh and Ilya Sutskever on What Comes After Scaling

The a16z Show·2 months ago

Scarce, Actively Generated Data Is the New Moat for Robotics and Biology AI

The future of valuable AI lies not in models trained on the abundant public internet, but in those built on scarce, proprietary data. For fields like robotics and biology, this data doesn't exist to be scraped; it must be actively created, making the data generation process itself the key competitive moat.

Josh Wolfe & Brett McGurk – Venture, Geopolitics, and the Next Frontier (EP.476)

Capital Allocators – Inside the Institutional Investment Industry·2 months ago

AI Must Learn Physicists' Reasoning Strategies, Not Just Pattern-Match on Data

To make genuine scientific breakthroughs, an AI needs to learn the abstract reasoning strategies and mental models of expert scientists. This involves teaching it higher-level concepts, such as thinking in terms of symmetries, a core principle in physics that current models lack.

Training an AI Scientist with Feedback from Reality, w- Liam Fedus & Ekin Dogus Cubuk (from a16z)

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

AI's Core Bottleneck Is Poor Generalization, Not Scale

The most fundamental challenge in AI today is not scale or architecture, but the fact that models generalize dramatically worse than humans. Solving this sample efficiency and robustness problem is the true key to unlocking the next level of AI capabilities and real-world impact.

Ilya Sutskever – The age of scaling is over

Dwarkesh Podcast·3 months ago

The Missing Link for AI in Science Is an Iterative Loop of Hypothesis and Experiment

Current LLMs fail at science because they lack the ability to iterate. True scientific inquiry is a loop: form a hypothesis, conduct an experiment, analyze the result (even if incorrect), and refine. AI needs this same iterative capability with the real world to make genuine discoveries.

Training an AI Scientist with Feedback from Reality, w- Liam Fedus & Ekin Dogus Cubuk (from a16z)

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·5 months ago

The 'Bitter Lesson' of AI Fails for Robotics Due to Data Misalignment

The "bitter lesson" (scale and simple models win) works for language because training data (text) aligns with the output (text). Robotics faces a critical misalignment: it's trained on passive web videos but needs to output physical actions in a 3D world. This data gap is a fundamental hurdle that pure scaling cannot solve.

The Godmother of AI on jobs, robots & why world models are next | Dr. Fei-Fei Li

Lenny's Podcast: Product | Career | Growth·3 months ago