Synthesia initially targeted Hollywood with AI dubbing—a "vitamin" for experts. They found a much larger, "house-on-fire" problem by building a platform for the billions of people who couldn't create video at all, democratizing the medium instead of just improving it for existing professionals.
Today's dominant AI tools like ChatGPT are perceived as productivity aids, akin to "homework helpers." The next multi-billion dollar opportunity is in creating the go-to AI for fun, creativity, and entertainment—the app people use when they're not working. This untapped market focuses on user expression and play.
Don't view generative AI video as just a way to make traditional films more efficiently. Ben Horowitz sees it as a fundamentally new creative medium, much like movies were to theater. It enables entirely new forms of storytelling by making visuals that once required massive budgets accessible to anyone.
Instead of fearing competitors who copy their product, Synthesia's founder sees them as a net positive. The increased competition generates more market iterations and signals, helping them discover the most valuable use cases for the new technology faster than they could alone, while also sharpening their focus.
The democratization of technology via AI shifts the entrepreneurial goalpost. Instead of focusing on creating a handful of billion-dollar "unicorns," the more impactful ambition is to empower millions of people to each build a million-dollar "donkey corn" business, truly broadening economic opportunity.
While today's focus is on text-based LLMs, the true, defensible AI battleground will be in complex modalities like video. Generating video requires multiple interacting models and unique architectures, creating far greater potential for differentiation and a wider competitive moat than text-based interfaces, which will become commoditized.
The company's founding insight stemmed from the poor quality of Polish movie dubbing, where one monotone voice narrates all characters. This specific, local pain point highlighted a universal desire for emotionally authentic, context-aware voice technology, proving that niche frustrations can unlock billion-dollar opportunities.
For companies with jaw-dropping technology, it's easy to chase 'wow moments' and PR instead of solving real problems. Synthesia instills a core value of 'utility over novelty,' obsessing over delivering value for enterprise customers rather than getting lost in the novelty of their own tech.
The real economic value of generative video lies in advertising, not filmmaking. Unlike movies with finite consumption, there is unlimited demand for personalized, diverse ad content. This makes advertising a perfect fit for the technology's scalable content creation capabilities.
Business owners and experts uncomfortable with content creation can now scale their presence. By cloning their voice (e.g., with 11labs) and pairing it with an AI video avatar (e.g., with HeyGen), they can produce high volumes of expert content without stepping in front of a camera, removing a major adoption barrier.
The founders, not being PhD AI researchers, knew they couldn't rely on being acqui-hired by a tech giant. This perceived weakness became a strength, forcing them to relentlessly focus on finding customers and building a sustainable business from day one, unlike many research-led AI startups of that era.