Sunday Robotics Is Scaling Data Collection With Gloves, Not Just Robots

Related Insights

Robotics Data Startups Must Combine Software with Human Operations for a Complete Solution

Unlike LLMs that train on the existing internet, robotics lacks a pre-training dataset for the physical world. This forces companies like Encore to build a full-stack solution combining a software platform for data management with human-led operations for data collection, annotation, and even real-time remote robot piloting for exception handling.

Inside Amazon’s Potential $50B OpenAI Investment, Nvidia’s Impressive Earnings & Stock Fall

The Information's TITV·5 months ago

Robotics Startup Sunday Decouples Data Collection from Hardware Deployment with 'Skill Capture Gloves'

To overcome the data bottleneck in robotics, Sunday developed gloves that capture human hand movements. This allows them to train their robot's manipulation skills without needing a physical robot for teleoperation. By separating data gathering (gloves) from execution (robot), they can scale their training dataset far more efficiently than competitors who rely on robot-in-the-loop data collection methods.

NVIDIA Beats Earnings, Google Launches Nano Banana Pro, 𝕏 Timeline Reactions | David Chang, Loredana Crisan, Tarek Alaruri, Tony Zhao, Nikita Rudin

TBPN·8 months ago

Robotics Lacks an 'Internet-Scale' Public Dataset, Forcing Firms to Bootstrap Data Collection

The rapid progress of many LLMs was possible because they could leverage the same massive public dataset: the internet. In robotics, no such public corpus of robot interaction data exists. This “data void” means progress is tied to a company's ability to generate its own proprietary data.

Uncapped #32 | Kyle Vogt from The Bot Company

Uncapped with Jack Altman·9 months ago

Prioritize Extreme Affordability in Early Home Robots to Fuel the Data Flywheel

For consumer robotics, the biggest bottleneck is real-world data. By aggressively cutting costs to make robots affordable, companies can deploy more units faster. This generates a massive data advantage, creating a feedback loop that improves the product and widens the competitive moat.

Uncapped #32 | Kyle Vogt from The Bot Company

Uncapped with Jack Altman·9 months ago

Robotics AI Models Can Now Learn from Human Video, Unlocking a Scalable Training Path

Physical Intelligence demonstrated an emergent capability where its robotics model, after reaching a certain performance threshold, significantly improved by training on egocentric human video. This solves a major bottleneck by leveraging vast, existing video datasets instead of expensive, limited teleoperated data.

Amazon x OpenAI, Ford's EV Reality Check, Kushner Drops WB Bid | Sarah Guo, David Senra, Doug O'Laughlin, Doug Bernauer, Jacob Effron, Logan Kilpatrick

TBPN·7 months ago

Humanoid Robot Development is Bottlenecked by In-Home Data Collection, Not Hardware

Progress in robotics for household tasks is limited by a scarcity of real-world training data, not mechanical engineering. Companies are now deploying capital-intensive "in-field" teams to collect multi-modal data from inside homes, capturing the complexity of mundane human activities to train more capable robots.

Centific’s Role in AI Boom, Databricks $134B Valuation, Alien Hunter Funding | Dec 16, 2025

The Information's TITV·8 months ago

Diffusion Models Unlocked Non-Expert, Scalable Data Collection

Previously, imitation learning required a single expert to collect perfectly consistent data, a major bottleneck. Diffusion models unlocked the ability to train on multi-modal data from various non-expert collectors, shifting the challenge from finding niche experts to building scalable data acquisition and processing systems.

Sunday Robotics: Scaling the Home Robot Revolution with Co-Founders Tony Zhao and Cheng Chi

No Priors: Artificial Intelligence | Technology | Startups·8 months ago

Scarce, Actively Generated Data Is the New Moat for Robotics and Biology AI

The future of valuable AI lies not in models trained on the abundant public internet, but in those built on scarce, proprietary data. For fields like robotics and biology, this data doesn't exist to be scraped; it must be actively created, making the data generation process itself the key competitive moat.

Josh Wolfe & Brett McGurk – Venture, Geopolitics, and the Next Frontier (EP.476)

Capital Allocators – Inside the Institutional Investment Industry·8 months ago

Better Data Unlocked Transformers for Robotics, Not Vice-Versa

The adoption of powerful AI architectures like transformers in robotics was bottlenecked by data quality, not algorithmic invention. Only after data collection methods improved to capture more dexterous, high-fidelity human actions did these advanced models become effective, reversing the typical 'algorithm-first' narrative of AI progress.

Sunday Robotics: Scaling the Home Robot Revolution with Co-Founders Tony Zhao and Cheng Chi

No Priors: Artificial Intelligence | Technology | Startups·8 months ago

Humanoid Robot Companies Sell Hardware at a Loss to Gather Valuable Training Data

Firms are deploying consumer robots not for immediate profit but as a data acquisition strategy. By selling hardware below cost, they collect vast amounts of real-world video and interaction data, which is the true asset used to train more advanced and capable AI models for future applications.

What in the world: predictions for 2026

Economist Podcasts·7 months ago

Get your free personalized podcast brief

Related Insights