AI Solves Math but Not Unemployment Because Societal Problems Lack Verifiable Answers

Related Insights

Evaluating AI Success in Subjective Fields Is the Technology's Hardest Unsolved Problem

AI excels where success is quantifiable (e.g., code generation). Its greatest challenge lies in subjective domains like mental health or education. Progress requires a messy, societal conversation to define 'success,' not just a developer-built technical leaderboard.

AI: The new frontier for mental health support?

Masters of Scale·6 months ago

Andrej Karpathy: AI Excels at Verifiable Tasks, Explaining its 'Jagged Frontier'

Andrej Karpathy's 'Software 2.0' framework posits that AI automates tasks that are easily *verifiable*. This explains the 'jagged frontier' of AI progress: fields like math and code, where correctness is verifiable, advance rapidly. In contrast, creative and strategic tasks, where success is subjective and hard to verify, lag significantly behind.

Bezos Launches AI Startup, GPT-4o Debate, LeCun’s LLM Revolt | Eric Glyman, Stacy Rasgon, Luca Ferrari, Healey Cypher, John Tenet, Reed Duchscher

TBPN·6 months ago

AI Models Are Over-Specialized 'Competitive Programmers'

Current AI models resemble a student who grinds 10,000 hours on a narrow task. They achieve superhuman performance on benchmarks but lack the broad, adaptable intelligence of someone with less specific training but better general reasoning. This explains the gap between eval scores and real-world utility.

Ilya Sutskever – The age of scaling is over

Dwarkesh Podcast·5 months ago

AI Models Struggle Most with Uncodified 'Taste-Based' Expert Knowledge

AI performs poorly in areas where expertise is based on unwritten 'taste' or intuition rather than documented knowledge. If the correct approach doesn't exist in training data or isn't explicitly provided by human trainers, models will inevitably struggle with that particular problem.

Brendan Foody on Teaching AI and the Future of Knowledge Work

Conversations with Tyler·4 months ago

Predictive AI Often Fails by Highlighting Systemic Problems It Cannot Solve

The promise of "techno-solutionism" falls flat when AI is applied to complex social issues. An AI project in Argentina meant to predict teen pregnancy simply confirmed that poverty was the root cause—a conclusion that didn't require invasive data collection and that technology alone could not fix, exposing the limits of algorithmic intervention.

Living in the Shadow of AI

The Next Big Idea Daily·6 months ago

The Inability to Verify 'Correctness' in the Real World Limits AI Self-Improvement

Demis Hassabis identifies a key obstacle for AGI. Unlike in math or games where answers can be verified, the messy real world lacks clear success metrics. This makes it difficult for AI systems to use self-improvement loops, limiting their ability to learn and adapt outside of highly structured domains.

Best of Big Technology: Demis Hassabis On AGI, Deceptive AIs, Building a Virtual Cell

Big Technology Podcast·4 months ago

AI Reasoning Fails to Generalize from Puzzles to Messy, Real-World Tasks

Hopes that AI's new reasoning skills in checkable domains like math and code would generalize to ambiguous, real-world tasks like booking a flight did not materialize. This failure of 'reasoning generalization' was a major technical roadblock that forced experts to lengthen AGI timelines.

What the hell happened with AGI timelines in 2025?

80,000 Hours Podcast·3 months ago

AI's Utility Is Bottlenecked by Human Verification, Especially for Non-Visual Outputs

AI can generate vast amounts of content, but its value is limited by our ability to verify its accuracy. This is fast for visual outputs (images, UI) where our eyes instantly spot flaws, but slow and difficult for abstract domains like back-end code, math, or financial data, which require deep expertise to validate.

Balaji & Benedict Evans: When Tech Breaks Industries

The a16z Show·3 months ago

AI Systems Fail from Flawed Societal Models, Not Inadequate Algorithms

AI systems often collapse because they are built on the flawed assumption that humans are logical and society is static. Real-world failures, from Soviet economic planning to modern systems, stem from an inability to model human behavior, data manipulation, and unexpected events.

949: Why AI Keeps Failing Society, with Stanford professor Alex “Sandy” Pentland

Super Data Science: ML & AI Podcast with Jon Krohn·5 months ago

AI Models Are Over-trained 'Competitive Programmers' Who Lack Real-World Judgment

AI models excel at specific tasks (like evals) because they are trained exhaustively on narrow datasets, akin to a student practicing 10,000 hours for a coding competition. While they become experts in that domain, they fail to develop the broader judgment and generalization skills needed for real-world success.

Dwarkesh and Ilya Sutskever on What Comes After Scaling

The a16z Show·5 months ago

Get your free personalized podcast brief

Related Insights