Anthropic's Sholto Douglas Says Observing Human Work Is Better Training Data Than Documents

Related Insights

Enterprise AI Requires a 'Tandem System' Where Humans and AI Train Each Other

Effective enterprise AI deployment involves running human and AI workflows in parallel. When the AI fails, it generates a data point for fine-tuning. When the human fails, it becomes a training moment for the employee. This "tandem system" creates a continuous feedback loop for both the model and the workforce.

20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·3 months ago

Let AI Agents Discover a Company's 'Real' Rules by Observing Workflows

Rather than programming AI agents with a company's formal policies, a more powerful approach is to let them observe thousands of actual 'decision traces.' This allows the AI to discover the organization's emergent, de facto rules—how work *actually* gets done—creating a more accurate and effective world model for automation.

Context Graphs: AI's Next Big Idea

The AI Daily Brief: Artificial Intelligence News and Analysis·a month ago

Anthropic Prioritizes AI 'Vision In' to Mimic Real Developer Workflows

Anthropic strategically focuses on "vision in" (AI understanding visual information) over "vision out" (image generation). This mimics a real developer who needs to interpret a user interface to fix it, but can delegate image creation to other tools or people. The core bet is that the primary bottleneck is reasoning, not media generation.

Reviewing the Best AI Apps, Anthropic Unveils Claude 4.5 Opus, Doug DeMuro | Sholto Douglas, Quinn Slack, Alex Stauffer & Alex Shevchenko

TBPN·3 months ago

Agentic AI Training Requires Simulated 'RL Environments,' Not Just Traditional RLHF

Training AI agents to execute multi-step business workflows demands a new data paradigm. Companies create reinforcement learning (RL) environments—mini world models of business processes—where agents learn by attempting tasks, a more advanced method than simple prompt-completion training (SFT/RLHF).

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·3 months ago

AI's True Power in R&D Is Actively Capturing Tribal Knowledge, Not Just Answering Questions

Shift your view of AI from a passive chatbot to an active knowledge-capture system. The greatest value comes from AI designed to prompt team members for their unique insights, then storing and attributing that information. This transforms fleeting tribal knowledge into a permanent, searchable organizational asset.

565: AI tools to accelerate innovation and capture knowledge – with Katie Trauth Taylor, PhD

Product Mastery Now for Product Managers, Leaders, and Innovators·3 months ago

Mercor CEO: Knowledge Work is Evolving From 'Doing' to 'Training' AI Agents

Instead of repeatedly performing tasks, knowledge workers will train AI agents by creating "evals"—data sets that teach the AI how to handle specific workflows. This fundamental shift means the economy will transition from paying for human execution to paying for human training data.

Suno Sparks Music Rights Firestorm, Travis Kelce’s Six Flags Play | Philip Johnston, Justin Murphy, Darren Rovell, Guillermo Rauch, Brendan Foody

TBPN·4 months ago

Enterprise AI Fails When It Can't Digitize a Company's Specific Human Judgment

Off-the-shelf AI models can only go so far. The true bottleneck for enterprise adoption is "digitizing judgment"—capturing the unique, context-specific expertise of employees within that company. A document's meaning can change entirely from one company to another, requiring internal labeling.

First interview with Scale AI’s CEO: $14B Meta deal, what’s working in enterprise AI, and what frontier labs are building next | Jason Droege

Lenny's Podcast: Product | Career | Growth·4 months ago

Effective AI Orchestration Requires Codifying Tacit Knowledge from Employee Actions

To build coordinated AI agent systems, firms must first extract siloed operational knowledge. This involves not just digitizing documents but systematically observing employee actions like browser clicks and phone calls to capture unwritten processes, turning this tacit knowledge into usable context for AI.

Big Ideas 2026: The Enterprise Orchestration Layer

The a16z Show·2 months ago

RL Environments Are a Fad; The Best Training Data Comes From Real-World User Logs

The trend of buying expensive, simulated Reinforcement Learning (RL) environments is misguided. The most effective and valuable training ground is the live application itself. Companies can achieve better results by using logs and traces from actual users, which provides the most accurate data for agent improvement.

[Latent Space LIVE @ NeurIPS] State of AI Startups 2025 — with Sarah Catanzaro, Amplify Partners

Latent Space: The AI Engineer Podcast·2 months ago

AI's Killer Feature for Companies Isn't Automation, It's Creating a Queryable Corporate Memory

The ultimate value of AI will be its ability to act as a long-term corporate memory. By feeding it historical data—ICPs, past experiments, key decisions, and customer feedback—companies can create a queryable "brain" that dramatically accelerates onboarding and institutional knowledge transfer.

Marketing in the Age of AI: CMOs Separating the Hype from What’s Real

The Dave Gerhardt Show·4 months ago