The Post-Scraping Era Creates a New Freelance Economy for "Human Data Experts"

Related Insights

Mercore’s $10B Valuation Proves Human Expertise Is AI's Most Valuable Fuel

AI startup Mercore's valuation quintupled to $10B by connecting AI labs with domain experts to train models. This reveals that the most critical bottleneck for advanced AI is not just data or compute, but reinforcement learning from highly skilled human feedback, creating a new "RL economy."

#178: OpenAI’s Automated AI Researcher, OpenAI Restructuring, The Fed Warns About AI’s Impact on Hiring, Nvidia Hits $5 Trillion & Wharton Data on AI ROI

The Artificial Intelligence Show·7 months ago

LLMs Have Exhausted the Public Web; The Next Performance Leap is Human Expert Data

LLMs have hit a wall by scraping nearly all available public data. The next phase of AI development and competitive differentiation will come from training models on high-quality, proprietary data generated by human experts. This creates a booming "data as a service" industry for companies like Micro One that recruit and manage these experts.

Netflix buys WB + why Jason should run Disney | E2219

This Week in Startups·5 months ago

The Emergence of the "AI Trainer" Role for Niche Expertise

To move beyond general knowledge, AI firms are creating a new role: the "AI Trainer." These are not contractors but full-time employees, typically PhDs with deep domain expertise and a computer science interest, tasked with systematically improving model competence in specific fields like physics or mathematics.

Why data is the biggest AI bottleneck (feat. Arthur Mensch of Mistral AI) | E2212

This Week in Startups·6 months ago

AI's Data Needs Have Shifted from "Labeling Sweatshops" to "Strategic Research Accelerators"

The era of simple data labeling is over. Frontier AI models now require complex, expert-generated data to break current capabilities and advance research. Data providers like Turing now act as strategic research partners to AI labs, not just data factories.

Inside The $2.2B AI Research Accelerator | Turing

Sourcery·7 months ago

Mercor CEO: Knowledge Work is Evolving From 'Doing' to 'Training' AI Agents

Instead of repeatedly performing tasks, knowledge workers will train AI agents by creating "evals"—data sets that teach the AI how to handle specific workflows. This fundamental shift means the economy will transition from paying for human execution to paying for human training data.

Suno Sparks Music Rights Firestorm, Travis Kelce’s Six Flags Play | Philip Johnston, Justin Murphy, Darren Rovell, Guillermo Rauch, Brendan Foody

TBPN·7 months ago

The AI Bottleneck Has Shifted from Compute to Data

For years, access to compute was the primary bottleneck in AI development. Now, as public web data is largely exhausted, the limiting factor is access to high-quality, proprietary data from enterprises and human experts. This shifts the focus from building massive infrastructure to forming data partnerships and expertise.

Why data is the biggest AI bottleneck (feat. Arthur Mensch of Mistral AI) | E2212

This Week in Startups·6 months ago

AI Labs Are Paying Experts Millions Daily to Train Their Replacements in Simulated "RL Gyms"

Companies like OpenAI and Anthropic are spending billions creating simulated enterprise apps (RL gyms) where human experts train AI models on complex tasks. This has created a new, rapidly growing "AI trainer" job category, but its ultimate purpose is to automate those same expert roles.

#168: The AI Economy, How People Use ChatGPT, AI-Native Companies, Meta Ray-Ban Display AI Glasses & How Americans View AI

The Artificial Intelligence Show·8 months ago

Mercore's Rise Signals a New "Reinforcement Learning Economy" for Elite Human Experts

Mercore's $500M revenue in 17 months highlights a shift in AI training. The focus is moving from low-paid data labelers to a marketplace of elite experts like doctors and lawyers providing high-quality, nuanced data. This creates a new, lucrative gig economy for top-tier professionals.

#170: How ChatGPT Is Used at Work, New GDPval Benchmark, AI “Workslop,” ChatGPT Pulse, Meta Vibes & More AI Economy Warnings

The Artificial Intelligence Show·8 months ago

Your AI Data Costs Are Rising for Two Reasons

Data is becoming more expensive not from scarcity, but because the work has evolved. Simple labeling is over. Costs are now driven by the need for pricey domain experts for specialized data preparation and creative teams to build complex, synthetic environments for training agents.

20VC: Cohere's Chief Scientist on Why Scaling Laws Will Continue | Whether You Can Buy Success in AI with Talent Acquisitions | The Future of Synthetic Data & What It Means for Models | Why AI Coding is Akin to Image Generation in 2015 with Joelle Pineau

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·7 months ago

OpenAI's 'Project Mercury' Reveals Its Playbook for Automating High-Skilled Professions

By paying over 100 former Wall Street bankers to train its models on complex financial tasks, OpenAI is creating a template for vertical AI dominance. This 'expert-as-a-contractor' model will be replicated across law, accounting, and consulting to systematically automate lucrative knowledge work sectors.

#176: ChatGPT Atlas, ChatGPT Atlas Security Issues, Letter to Pause Superintelligence, Amazon’s Plan to Automate 600,000 Jobs & New Data on AI Relationships

The Artificial Intelligence Show·7 months ago

Get your free personalized podcast brief

Related Insights