The Widely Held "Data is the New Oil" Thesis Has Largely Failed

Related Insights

AI Model Progress Now Hinges on Unlocking Trapped Enterprise Data

The industry has already exhausted the public web data used to train foundational AI models, a point underscored by the phrase "we've already run out of data." The next leap in AI capability and business value will come from harnessing the vast, proprietary data currently locked behind corporate firewalls.

AI Exchanges: The Role of Data

Exchanges·5 months ago

Hedge Funds Outpace Corporations in Data Use Due to Daily Decision-Making Cadence

Hedge funds have a constant, daily need to make informed buy, sell, or hold decisions, creating a clear business problem that data solves. Corporations often lack this frequent, high-stakes decision-making cycle, making the value proposition of external data less immediate and harder to justify.

YipitData CEO Vin Vacanti - why hedge funds dominate data usage (and corporations don't)

"World of DaaS"·2 months ago

The Next AI Breakthroughs Will Come From Proprietary Enterprise Data, Not Public Data

Public internet data has been largely exhausted for training AI models. The real competitive advantage and source for next-generation, specialized AI will be the vast, untapped reservoirs of proprietary data locked inside corporations, like R&D data from pharmaceutical or semiconductor companies.

From Ghaziabad to Silicon Valley: Nikhil Kamath x Nikesh Arora | People by WTF | Ep. 11

People by WTF·8 months ago

AI Boom Distorts Market Sizing as 'Everyone Is In-Market' Simultaneously

Unlike traditional B2B markets where only ~5% of customers are buying at any time, the AI boom has pushed nearly 100% of companies to seek solutions at once. This temporary gold rush warps perception of market size, creating a risk of over-investment similar to the COVID-era software bubble.

20VC: Why VC Today is Worse Than 2021 | Why Vertical SaaS is a Bad Investment Today | Why We Are Deluding Ourselves on Growth Expectations | Revolut Raises $3BN at a $75BN Valuation | Benchmark Adds Their Newest General Partner

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch·4 months ago

VC Firm Foresight Capital Admits Its AI Investment Tools Haven't Yet Improved Performance

Despite a long-standing data-science-driven investment thesis, Foresight Capital's founder Jim Tananbaum states that AI tools have not yet objectively led to increased investment returns. The technology is still maturing, highlighting a reality gap between the hype around AI in VC and its current practical impact.

493. From Data Science to Drug Design: How AI Shifts Discovery, Target Validation, and Portfolio Construction (Jim Tananbaum)

The Full Ratchet (TFR): Venture Capital and Startup Investing Demystified·4 months ago

Private Equity Leaders Mistakenly Treat Data as Back-Office "Plumbing" Instead of Strategy

Many leaders focus on data for backward-looking reporting, treating it like infrastructure. The real value comes from using data strategically for prediction and prescription. This requires foundational investment in technology, architecture, and machine learning capabilities to forecast what will happen and what actions to take.

Data, Deals, and Disruption The Real AI Shift in Private Equity

The Private Equity Podcast, by Raw Selection·3 months ago

The AI Bottleneck Has Shifted from Compute to Data

For years, access to compute was the primary bottleneck in AI development. Now, as public web data is largely exhausted, the limiting factor is access to high-quality, proprietary data from enterprises and human experts. This shifts the focus from building massive infrastructure to forming data partnerships and expertise.

Why data is the biggest AI bottleneck (feat. Arthur Mensch of Mistral AI) | E2212

This Week in Startups·3 months ago

The AI Data Labeling Market Is a High-Growth but Fragile Ecosystem

While data labeling companies show massive revenue growth, their customer base is often limited to a few frontier AI labs. This creates a lopsided market where providers have little leverage, compete on price, and are heavily dependent on a handful of clients, making the ecosystem potentially unstable.

Al Engineering 101 with Chip Huyen (Nvidia, Stanford, Netflix)

Lenny's Podcast: Product | Career | Growth·4 months ago

AI Valuations Assume an Economic Transformation That Hasn't Materialized

While AI investment has exploded, US productivity has barely risen. Valuations are priced as if a societal transformation is complete, yet 95% of GenAI pilots fail to positively impact company P&Ls. This gap between market expectation and real-world economic benefit creates systemic risk.

It’s Already Happening: The AI Bubble No One’s Ready For

Tom Bilyeu's Impact Theory·3 months ago

The Data Tools "Party" Ended Because the TAM Was Smaller Than Believed

The boom in tools for data teams faded because the Total Addressable Market (TAM) was overestimated. Investors and founders pattern-matched the data space to larger markets like cloud and dev tools, but the actual number of teams with the budget and need for sophisticated data tooling proved to be much smaller.

From Chaos to Code: HumanLayer’s Playbook for Agent-Driven Dev

The Lobster Talks Podcast by Lobster Capital·5 months ago