ByteDance's SeedDance 2.0 model integrates audio generation directly with video, a novel approach that suggests China may be starting to leapfrog the US in specific AI capabilities. This challenges the common narrative that China is only a fast follower in the AI race.
While today's focus is on text-based LLMs, the true, defensible AI battleground will be in complex modalities like video. Generating video requires multiple interacting models and unique architectures, creating far greater potential for differentiation and a wider competitive moat than text-based interfaces, which will become commoditized.
China is gaining an efficiency edge in AI by using "distillation"—training smaller, cheaper models from larger ones. This "train the trainer" approach is much faster and challenges the capital-intensive US strategy, highlighting how inefficient and "bloated" current Western foundational models are.
Joe Tsai reframes the US-China 'AI race' as a marathon won by adoption speed, not model size. He notes China’s focus on open source and smaller, specialized models (e.g., for mobile devices) is designed for faster proliferation and practical application. The goal is to diffuse technology throughout the economy quickly, rather than simply building the single most powerful model.
With the release of OpenAI's new video generation model, Sora 2, a surprising inversion has occurred. The generated video is so realistic that the accompanying AI-generated audio is now the more noticeable and identifiable artificial component, signaling a new frontier in multimedia synthesis.
Counterintuitively, China leads in open-source AI models as a deliberate strategy. This approach allows them to attract global developer talent to accelerate their progress. It also serves to commoditize software, which complements their national strength in hardware manufacturing, a classic competitive tactic.
Challenging the narrative of pure technological competition, Jensen Huang points out that American AI labs and startups significantly benefited from Chinese open-source contributions like the DeepSeek model. This highlights the global, interconnected nature of AI research, where progress in one nation directly aids others.
By natively embedding a full suite of AI tools for video generation, editing, and ideation, TikTok is evolving beyond a content distribution platform. It is becoming a self-contained creation engine, reducing creator reliance on third-party apps and positioning itself to challenge YouTube's dominance.
While the US focuses on creating the most advanced AI models, China's real strength may be its proven ability to orchestrate society-wide technology adoption. Deep integration and widespread public enthusiasm for AI could ultimately provide a more durable competitive advantage.
According to DeepMind CEO Demis Hassabis, while Chinese AI models are rapidly closing the capability gap with US counterparts, they have yet to demonstrate the ability to create truly novel breakthroughs, like a new transformer architecture. Their strength lies in catching up to the frontier, not pushing beyond it.
While the U.S. leads in closed, proprietary AI models like OpenAI's, Chinese companies now dominate the leaderboards for open-source models. Because they are cheaper and easier to deploy, these Chinese models are seeing rapid global uptake, challenging the U.S.'s perceived lead in AI through wider diffusion and application.