Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

Cuban advises companies to treat their internal data and research as a competitive advantage. Applying for patents or publishing research makes that IP public, allowing any AI model to train on it and instantly commoditize the knowledge, destroying the company's unique edge.

Related Insights

Public internet data has been largely exhausted for training AI models. The real competitive advantage and source for next-generation, specialized AI will be the vast, untapped reservoirs of proprietary data locked inside corporations, like R&D data from pharmaceutical or semiconductor companies.

Mark Cuban warns that patenting work makes it public, allowing any AI model to train on it instantly. To maintain a competitive data advantage, he suggests companies should increasingly rely on trade secrets, keeping their valuable IP out of the public domain and away from competitors' models.

A key competitive advantage for AI companies lies in capturing proprietary outcomes data by owning a customer's end-to-end workflow. This data, such as which legal cases are won or lost, is not publicly available. It creates a powerful feedback loop where the AI gets smarter at predicting valuable outcomes, a moat that general models cannot replicate.

The pace of AI development means a startup's competitive advantage can be erased overnight by the next model release from a major lab like Google or Anthropic. Dr. el Kaliouby stresses that true defensibility now requires more than just a proprietary algorithm; it demands unique data, distribution, or IP that cannot be easily replicated.

For deep tech hardware firms like Cerebras, intellectual property protection goes beyond patents. Because patents require public disclosure, a more effective strategy involves a combination of trade secrets and segmenting the manufacturing process across different partners, preventing any single entity from understanding the complete design.

Since LLMs are commodities, sustainable competitive advantage in AI comes from leveraging proprietary data and unique business processes that competitors cannot replicate. Companies must focus on building AI that understands their specific "secret sauce."

Cuban predicts a "SaaS apocalypse" where generic software is easily replaced by AI. The survivors will be companies whose value lies not just in software but in a unique, proprietary database of information that cannot be easily replicated by training a public LLM.

As AI models become commoditized, the ultimate defensibility comes from exclusive access to a unique dataset. A startup with a slightly inferior model but a comprehensive, proprietary dataset (e.g., all legal records) will beat a superior, general-purpose model for specialized tasks, creating a powerful long-term advantage.

Companies create defensibility by generating unique, non-public data through their operations (e.g., legal case outcomes). This proprietary data improves their own models, creating a feedback loop and a compounding advantage that large, generalist labs like OpenAI cannot replicate.

The initial AI boom was fueled by scraping the public internet. Cuban predicts the next phase will be dominated by exclusive data deals. Content owners, like medical journals, will protect their IP and auction it to the highest-bidding AI companies, creating valuable data silos.