Humanoid Robots Remain Far Off Due to a Severe "Real World" Data Gap

Related Insights

Data Acquisition Is Now the Primary Blocker to Mass Humanoid Robot Deployment

According to Figure's CEO, the company's biggest challenge is no longer hardware reliability but acquiring enormous amounts of diverse, high-quality data. This data is essential for pre-training their Helix AI model to generalize and handle countless real-world scenarios in homes and commercial settings.

Figure’s Humanoid Factory Tour – CEO Brett Adcock

Sourcery·2 months ago

Humanoid Robots Will Exceed B2B Expectations While Failing in B2C Consumer Markets

The future of humanoid robotics is not in our homes. While they will revolutionize structured B2B environments like 'dark' factories and data centers, consumer adoption will lag significantly due to a fundamental lack of desire for robots in personal, nuanced spaces.

Humanoid Robots, Building a Service Business, and Why CEOs Won’t Save Democracy

The Prof G Pod with Scott Galloway·8 months ago

Robotics Lacks an 'Internet-Scale' Public Dataset, Forcing Firms to Bootstrap Data Collection

The rapid progress of many LLMs was possible because they could leverage the same massive public dataset: the internet. In robotics, no such public corpus of robot interaction data exists. This “data void” means progress is tied to a company's ability to generate its own proprietary data.

Uncapped #32 | Kyle Vogt from The Bot Company

Uncapped with Jack Altman·8 months ago

Impressive Robot Demos Often Mask a Lack of Real-World Generalization

A flashy robot demo typically uses a highly controlled, pristine environment tailored to one task. True progress lies in a robot performing a mundane task reliably in any novel situation—a feat of generalization that is much harder to showcase visually and less exciting to a layperson.

Sergey Levine - Building LLMs for the Physical World - [Invest Like the Best, EP.465]

Invest Like the Best with Patrick O'Shaughnessy·3 months ago

Humanoid Robot Development is Bottlenecked by In-Home Data Collection, Not Hardware

Progress in robotics for household tasks is limited by a scarcity of real-world training data, not mechanical engineering. Companies are now deploying capital-intensive "in-field" teams to collect multi-modal data from inside homes, capturing the complexity of mundane human activities to train more capable robots.

Centific’s Role in AI Boom, Databricks $134B Valuation, Alien Hunter Funding | Dec 16, 2025

The Information's TITV·6 months ago

Robotics Lags Language AI Due to a '100,000-Year' Physical Data Gap

Ken Goldberg quantifies the challenge: the text data used to train LLMs would take a human 100,000 years to read. Equivalent data for robot manipulation (vision-to-control signals) doesn't exist online and must be generated from scratch, explaining the slower progress in physical AI.

TECH010: The Real Robotics Timeline w/ Ken Goldberg (Tech Podcast)

We Study Billionaires - The Investor’s Podcast Network·6 months ago

Robotics Lags Image Generation Because of "Data Poverty" in Physical Tasks

AI can generate art because it was trained on the internet's vast trove of images. It struggles with physical tasks like washing dishes because there is virtually no first-person video data for such actions. Solving this data-gathering problem is key to advancing robotics.

Jean-Marc Daecius - The Last Human Chief of Staff (Ep. 300)

Infinite Loops·5 months ago

The Home Market for Humanoids May Arrive Sooner Than the Factory Market

Initially, factories seemed like the easier first market for humanoids due to structured environments. However, Figure's founder now believes the home is a more near-term opportunity. The challenge of environmental variability is now seen as a data-bound problem that can be solved with large-scale data collection programs.

Humanoids Cost as Much as an SUV Now | Nikhil Kamath x Brett Adcock | WTF Online Ep 2

WTF Online·8 months ago

Humanoid Robots Face a "Chicken-and-Egg" Paradox Hindering Industrial Use

The humanoid robot industry is stalled by a data paradox: robots need vast amounts of real-world data from factory tasks to become useful, but they cannot be deployed in factories until they are already useful. This catch-22 forces companies to rely on simulated data, slowing the transition from entertainment props to industrial tools.

Let me get this strait: the Iran-war escalation risk

Economist Podcasts·3 months ago

The 'Bitter Lesson' of AI Fails for Robotics Due to Data Misalignment

The "bitter lesson" (scale and simple models win) works for language because training data (text) aligns with the output (text). Robotics faces a critical misalignment: it's trained on passive web videos but needs to output physical actions in a 3D world. This data gap is a fundamental hurdle that pure scaling cannot solve.

The Godmother of AI on jobs, robots & why world models are next | Dr. Fei-Fei Li

Lenny's Podcast: Product | Career | Growth·7 months ago

Get your free personalized podcast brief

Related Insights