/
© 2026 RiffOn. All rights reserved.

Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

  1. Latent Space: The AI Engineer Podcast
  2. World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI
World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast · Dec 6, 2025

General Intuition CEO Pim discusses building world models from Metal's 3.8B gaming clips, creating a unique data moat for spatial intelligence.

World Models Can Generate Physical Effects Not Present in Their Training Data

GI discovered their world model, trained on game footage, could generate a realistic camera shake during an in-game explosion—a physical effect not part of the game's engine. This suggests the models are learning an implicit understanding of real-world physics and can generate plausible phenomena that go beyond their source material.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI thumbnail

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·4 months ago

General Intuition's Robotics Strategy Focuses on Robots Controllable by Game Inputs

GI is not trying to solve robotics in general. Their strategy is to focus on robots whose actions can be mapped to a game controller. This constraint dramatically simplifies the problem, allowing their foundation models trained on gaming data to be directly applicable, shifting the burden for robotics companies from expensive pre-training to more manageable fine-tuning.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI thumbnail

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·4 months ago

Data-Rich Companies Must Build Their Own Models to Discover Their Asset's True Value

When approached by large labs for licensing deals, GI's founder advises against simply selling the data. He argues the only way to accurately value a unique dataset is to model it yourself to understand its true capabilities. Without this, founders risk massively undervaluing their core asset, as its potential is unknown.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI thumbnail

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·4 months ago

Metal's 'Tesla-Style' Retroactive Clipping Creates a Dataset of Peak Human Performance

Instead of continuous recording, Metal's software lets gamers save the last 30 seconds *after* an interesting event. This behavior, similar to Tesla's bug reporting, automatically filters the data, creating a massive dataset composed almost entirely of noteworthy, high-skill, or out-of-distribution moments, which is ideal for AI training.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI thumbnail

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·4 months ago

Metal's Gaming Platform Won by First Dominating a Utility Tool Before Building a Social Network

While competitors tried to build a social network and a recording tool simultaneously, Metal focused exclusively on creating the best video capture tool. By solving a critical user pain point first, they achieved massive scale (tens of millions of users), which they then leveraged to bootstrap a thriving social network on top of existing user behavior.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI thumbnail

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·4 months ago

Game Data Surpasses YouTube for Training Spatial Reasoning by Simulating Embodied Action

GI's founder argues game footage is a superior data source for spatial reasoning compared to real-world videos. Gaming directly links visual perception to hand-eye motor control ("simulating optical dynamics with your hand"), avoiding the information loss inherent in interpreting passive video, which requires solving for pose estimation and inverse dynamics.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI thumbnail

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·4 months ago

General Intuition Creates Cleaner Training Data by Logging Abstract Actions, Not Keystrokes

To protect user privacy, GI's system translates raw keyboard inputs (e.g., 'W' key) into their corresponding in-game actions (e.g., 'move forward'). This privacy-by-design approach has a key ML benefit: it removes noisy, user-specific key bindings and provides a standardized, canonical action space for training more generalizable agents.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI thumbnail

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·4 months ago

Advanced AI Game Bots Are a Key Player Retention Tool for Developers

General Intuition's first commercial use case for its human-like AI agents isn't a consumer product, but a B2B tool for game developers. High-quality bots are crucial for retaining players by ensuring full lobbies during off-peak hours when human player numbers are low, providing a clear, revenue-generating entry point for their sophisticated AI.

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI thumbnail

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Latent Space: The AI Engineer Podcast·4 months ago