Periodic Labs Uses Physical Experiments as the Ground Truth Reward Function for AI

Related Insights

AI Models Excel on Benchmarks But Fail in Reality Due to 'Teaching to the Test'

AI models show impressive performance on evaluation benchmarks but underwhelm in real-world applications. This gap exists because researchers, focused on evals, create reinforcement learning (RL) environments that mirror test tasks. This leads to narrow intelligence that doesn't generalize, a form of human-driven reward hacking.

Dwarkesh and Ilya Sutskever on What Comes After Scaling

The a16z Show·7 months ago

Periodic Labs' Superconductor Goal Forces Development of Foundational AI Capabilities

The ambitious goal of discovering a high-temperature superconductor isn't just a scientific target; it's a strategic choice. Achieving it requires building numerous sub-systems like autonomous synthesis and characterization, effectively forcing the creation of a general-purpose AI for science platform.

Training an AI Scientist with Feedback from Reality, w- Liam Fedus & Ekin Dogus Cubuk (from a16z)

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·9 months ago

Top Biotech Labs Now Design Experiments to Train AI, Not Just Answer Questions

The next leap in biotech moves beyond applying AI to existing data. CZI pioneers a model where 'frontier biology' and 'frontier AI' are developed in tandem. Experiments are now designed specifically to generate novel data that will ground and improve future AI models, creating a virtuous feedback loop.

Priscilla Chan and Mark Zuckerberg: Frontier AI + Virtual Biology To Solve All Diseases

Latent Space: The AI Engineer Podcast·8 months ago

AI for Science Fails on Public Data Due to Noise and Missing Negative Results

Foundation models can't be trained for physics using existing literature because the data is too noisy and lacks published negative results. A physical lab is needed to generate clean data and capture the learning signal from failed experiments, which is a core thesis for Periodic Labs.

Training an AI Scientist with Feedback from Reality, w- Liam Fedus & Ekin Dogus Cubuk (from a16z)

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·9 months ago

Simulated RL Environments Are the Next Frontier for Training Capable AI Agents

Beyond supervised fine-tuning (SFT) and human feedback (RLHF), reinforcement learning (RL) in simulated environments is the next evolution. These "playgrounds" teach models to handle messy, multi-step, real-world tasks where current models often fail catastrophically.

The 100-person AI lab that became Anthropic and Google's secret weapon | Edwin Chen (Surge AI)

Lenny's Podcast: Product | Career | Growth·7 months ago

Reinforcement Learning Uses Multiple Signals, Not Just Human Feedback (RLHF)

Reinforcement Learning with Human Feedback (RLHF) is a popular term, but it's just one method. The core concept is reinforcing desired model behavior using various signals. These can include AI feedback (RLAIF), where another AI judges the output, or verifiable rewards, like checking if a model's answer to a math problem is correct.

Al Engineering 101 with Chip Huyen (Nvidia, Stanford, Netflix)

Lenny's Podcast: Product | Career | Growth·8 months ago

Harvey AI Tackles Legal RL's Verification Problem By Using Partner Feedback as the Reward Function

Unlike coding with its verifiable unit tests, complex legal work lacks a binary success metric. Harvey addresses this reinforcement learning challenge by treating senior partner feedback and edits as the "reward function," mirroring how quality is judged in the real world. The ultimate verification is long-term success, like a merger avoiding future litigation.

Scaling Legal AI and Building Next-Generation Law Firms with Harvey Co-Founder and President Gabe Pereyra

No Priors: Artificial Intelligence | Technology | Startups·7 months ago

Reinforcement Learning Represents AI's Shift From Imitating Data to Achieving Goals

The transition from supervised learning (copying internet text) to reinforcement learning (rewarding a model for achieving a goal) marks a fundamental breakthrough. This method, used in Anthropic's Opus 3 model, allows AI to develop novel problem-solving capabilities beyond simple data emulation.

Jack Morris on Finding the Next Big AI Breakthrough

Odd Lots·9 months ago

The Frontier of AI Training Is Now Defining Better Benchmarks, Not Better Algorithms

As reinforcement learning (RL) techniques mature, the core challenge shifts from the algorithm to the problem definition. The competitive moat for AI companies will be their ability to create high-fidelity environments and benchmarks that accurately represent complex, real-world tasks, effectively teaching the AI what matters.

How Cognition Built the World's First AI Coding Agent—Before Claude Code

AI & I·9 months ago

The Missing Link for AI in Science Is an Iterative Loop of Hypothesis and Experiment

Current LLMs fail at science because they lack the ability to iterate. True scientific inquiry is a loop: form a hypothesis, conduct an experiment, analyze the result (even if incorrect), and refine. AI needs this same iterative capability with the real world to make genuine discoveries.

Training an AI Scientist with Feedback from Reality, w- Liam Fedus & Ekin Dogus Cubuk (from a16z)

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·9 months ago

Get your free personalized podcast brief

Related Insights