General Intuition Creates Cleaner Training Data by Logging Abstract Actions, Not Keystrokes

Related Insights

General Intuition's Robotics Strategy Focuses on Robots Controllable by Game Inputs

GI is not trying to solve robotics in general. Their strategy is to focus on robots whose actions can be mapped to a game controller. This constraint dramatically simplifies the problem, allowing their foundation models trained on gaming data to be directly applicable, shifting the burden for robotics companies from expensive pre-training to more manageable fine-tuning.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·2 months ago

Agentic AI Training Requires Simulated 'RL Environments,' Not Just Traditional RLHF

Training AI agents to execute multi-step business workflows demands a new data paradigm. Companies create reinforcement learning (RL) environments—mini world models of business processes—where agents learn by attempting tasks, a more advanced method than simple prompt-completion training (SFT/RLHF).

20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·3 months ago

Design "Dual-Use" Tools for Both Humans and AI Agents to Maximize Utility

When building for AI-powered environments, design tools to be equally usable by humans and the AI model. An elegant, simple design for humans often translates directly into an effective tool for AI agents, simplifying development and promoting shared logic.

Inside Claude Code From the Engineers Who Built It

AI & I·4 months ago

Use AI to Generate Synthetic Data for Prototyping Workflows Without Risking Internal Information

To test complex AI prompts for tasks like customer persona generation without exposing sensitive company data, first ask the AI to create realistic, synthetic data (e.g., fake sales call notes). This allows you to safely develop and refine prompts before applying them to real, proprietary information, overcoming data privacy hurdles in experimentation.

The AI That Builds Apps for You (Claude Opus 4.5 Explained)

Marketing Against The Grain·3 months ago

Simulated RL Environments Are the Next Frontier for Training Capable AI Agents

Beyond supervised fine-tuning (SFT) and human feedback (RLHF), reinforcement learning (RL) in simulated environments is the next evolution. These "playgrounds" teach models to handle messy, multi-step, real-world tasks where current models often fail catastrophically.

The 100-person AI lab that became Anthropic and Google's secret weapon | Edwin Chen (Surge AI)

Lenny's Podcast: Product | Career | Growth·2 months ago

Game Data Surpasses YouTube for Training Spatial Reasoning by Simulating Embodied Action

GI's founder argues game footage is a superior data source for spatial reasoning compared to real-world videos. Gaming directly links visual perception to hand-eye motor control ("simulating optical dynamics with your hand"), avoiding the information loss inherent in interpreting passive video, which requires solving for pose estimation and inverse dynamics.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·2 months ago

AI Product Teams Must Analyze Raw, Messy User Inputs, Not Just Clean Test Prompts

Developers often test AI systems with well-formed, correctly spelled questions. However, real users submit vague, typo-ridden, and ambiguous prompts. Directly analyzing these raw logs is the most crucial first step to understanding how your product fails in the real world and where to focus quality improvements.

Evals, error analysis, and better prompts: A systematic approach to improving your AI products | Hamel Husain (ML engineer)

How I AI·4 months ago

AI's Core Bottleneck Is Poor Generalization, Not Scale

The most fundamental challenge in AI today is not scale or architecture, but the fact that models generalize dramatically worse than humans. Solving this sample efficiency and robustness problem is the true key to unlocking the next level of AI capabilities and real-world impact.

Ilya Sutskever – The age of scaling is over

Dwarkesh Podcast·3 months ago

Metal's 'Tesla-Style' Retroactive Clipping Creates a Dataset of Peak Human Performance

Instead of continuous recording, Metal's software lets gamers save the last 30 seconds *after* an interesting event. This behavior, similar to Tesla's bug reporting, automatically filters the data, creating a massive dataset composed almost entirely of noteworthy, high-skill, or out-of-distribution moments, which is ideal for AI training.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·2 months ago

The Frontier of AI Training Is Now Defining Better Benchmarks, Not Better Algorithms

As reinforcement learning (RL) techniques mature, the core challenge shifts from the algorithm to the problem definition. The competitive moat for AI companies will be their ability to create high-fidelity environments and benchmarks that accurately represent complex, real-world tasks, effectively teaching the AI what matters.

How Cognition Built the World's First AI Coding Agent—Before Claude Code

AI & I·5 months ago