Waymo's AI Uses a Foundation Model to Train Specialized 'Teacher' Models

Related Insights

Waymo’s Rapid Scaling Was Unlocked by a Unified AI Backbone

The move from Waymo's 4th to 5th generation driver was a discontinuous jump. Waymo abandoned smaller, specialized ML models for a single AI backbone trained on a massive, nationwide dataset. This generalizable stack, rather than city-specific tuning, enabled its recent rapid scaling across the US.

From Models to Mobility: Building Waymo with Dmitri Dolgov

The a16z Show·2 months ago

Off-the-Shelf Vision Language Models Can Learn to Drive Nominally

Waymo demonstrated that a standard Vision Language Model (VLM) can be fine-tuned to output driving trajectories instead of text. While unsafe for public roads, it drives 'pretty darn well' in normal conditions, showing the surprising generalizability of foundational vision-language understanding.

From Models to Mobility: Building Waymo with Dmitri Dolgov

The a16z Show·2 months ago

Waymo Uses Cloud Models for Non-Real-Time Tasks Like Finding Lost Items

While safety-critical driving inference happens locally, Waymo leverages the cloud for operational tasks. After a ride, an off-board model analyzes the interior to check if a passenger left an item or if the car needs cleaning, which helps optimize fleet management without burdening the in-car compute.

From Models to Mobility: Building Waymo with Dmitri Dolgov

The a16z Show·2 months ago

World Models That Grasp Physics Are the Successor to LLMs

Large Language Models are limited because they lack an understanding of the physical world. The next evolution is 'World Models'—AI trained on real-world sensory data to understand physics, space, and context. This is the foundational technology required to unlock physical AI like advanced robotics.

Humanize AI before it dehumanizes us, with Dr. Rana el Kaliouby at SXSW

Masters of Scale·2 months ago

Waymo Augments Its End-to-End AI with Intermediate Representations for Simulation

A pure 'pixels in, actions out' model is insufficient for full autonomy. Waymo augments its end-to-end learning with structured, intermediate representations (like objects and road concepts). This provides crucial knobs for scalable simulation, safety validation, and defining reward functions.

From Models to Mobility: Building Waymo with Dmitri Dolgov

The a16z Show·2 months ago

Autonomous Driving Has Shifted From Brittle "Rules-Based" Systems to Trainable AI Models

Rivian's CEO explains that early autonomous systems, which were based on rigid rules-based "planners," have been superseded by end-to-end AI. This new approach uses a large "foundation model for driving" that can improve continuously with more data, breaking through the performance plateau of the older method.

Rivian CEO: 'We're really convicted' about skipping Carplay

Decoder with Nilay Patel·8 months ago

Waymo's Rapid Scaling Was Unlocked by a Unified AI Backbone in Gen 5

The transition from Gen 4 to Gen 5 was a discontinuous jump that enabled rapid expansion. Waymo made a "big bet on AI," replacing a system of many smaller, specialized ML models with a single, generalizable AI backbone. This new architecture, trained on diverse national data, was the key to scaling beyond specific pre-mapped areas.

The 20-year journey to fully autonomous cars with Dmitri Dolgov of Waymo

Cheeky Pint·2 months ago

Pure End-to-End AV Models Fail Due to Simulation and Validation Challenges

A pure "pixels-in, actions-out" model is insufficient for full autonomy. While easy to start, this approach is extremely inefficient to simulate and validate for safety-critical edge cases. Waymo augments its end-to-end system with intermediate representations (like objects and road signs) to make simulation and validation tractable.

The 20-year journey to fully autonomous cars with Dmitri Dolgov of Waymo

Cheeky Pint·2 months ago

Waive Teaches its AI to Reason Using "World Models" that Simulate Future Scenarios

The AI's ability to handle novel situations isn't just an emergent property of scale. Waive actively trains "world models," which are internal generative simulators. This enables the AI to reason about what might happen next, leading to sophisticated behaviors like nudging into intersections or slowing in fog.

How End-to-End Learning Created Autonomous Driving 2.0: Wayve CEO Alex Kendall

Training Data·6 months ago

Waymo's AI Architecture Uses Off-Board 'Teachers' to Train On-Device 'Student' Models

Waymo uses a foundation model to create specialized, high-capacity "teacher" models (Driver, Simulator, Critic) offline. These teachers then distill their knowledge into smaller, efficient "student" models that can run in real-time on the vehicle, balancing massive computational power with on-device constraints.

The 20-year journey to fully autonomous cars with Dmitri Dolgov of Waymo

Cheeky Pint·2 months ago

Get your free personalized podcast brief

Related Insights