Dr. Li rejects both utopian and purely fatalistic views of AI. Instead, she frames it as a humanist technology—a double-edged sword whose impact is entirely determined by human choices and responsibility. This perspective moves the conversation from technological determinism to one of societal agency and stewardship.
In 2015-2016, major tech companies actively avoided the term "AI," fearing it was tainted from previous "AI winters." It wasn't until around 2017 that branding as an "AI company" became a positive signal, highlighting the incredible speed of the recent AI revolution and shift in public perception.
Large language models are insufficient for tasks requiring real-world interaction and spatial understanding, like robotics or disaster response. World models provide this missing piece by generating interactive, reason-able 3D environments. They represent a foundational shift from language-based AI to a more holistic, spatially intelligent AI.
The "bitter lesson" (scale and simple models win) works for language because training data (text) aligns with the output (text). Robotics faces a critical misalignment: it's trained on passive web videos but needs to output physical actions in a 3D world. This data gap is a fundamental hurdle that pure scaling cannot solve.
Dr. Li views the distinction between AI and AGI as largely semantic and market-driven, rather than a clear scientific threshold. The original goal of AI research, dating back to Turing, was to create machines that can think and act like humans. The term "AGI" doesn't fundamentally change this North Star for scientists.
The 2012 breakthrough that ignited the modern AI era used the ImageNet dataset, a novel neural network, and only two NVIDIA gaming GPUs. This demonstrates that foundational progress can stem from clever architecture and the right data, not just massive initial compute power, a lesson often lost in today's scale-focused environment.
Self-driving cars, a 20-year journey so far, are relatively simple robots: metal boxes on 2D surfaces designed *not* to touch things. General-purpose robots operate in complex 3D environments with the primary goal of *touching* and manipulating objects. This highlights the immense, often underestimated, physical and algorithmic challenges facing robotics.
Dr. Li attributes her presence at pivotal moments in AI history (Stanford's SAIL, Google Cloud AI) to being intellectually fearless. This means taking risks, like restarting a tenure clock to join a better ecosystem, and diving into new, unproven areas without over-analyzing potential failures. It's a crucial trait for anyone aiming to make a significant impact.
