/

© 2026 RiffOn. All rights reserved.

Super Data Science: ML & AI Podcast with Jon Krohn
965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn · Feb 10, 2026

From a PhD side project to a $500M ARR AI giant. Will Falcon shares his story of PyTorch Lightning & merging to form a full-stack AI NeoCloud.

Multi-GPU Training Adoption Was Blocked By Poor Tooling, Not Lack of Demand

In 2019, 99% of workloads used a single GPU, not because researchers lacked bigger problems, but because the tooling for multi-GPU training was too complex. PyTorch Lightning's success at Facebook AI demonstrated that simplifying the process could unlock massive, latent demand for scaled-up computation.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

An Early Startup Used Reinforcement Learning to Optimize Financial Aid for Students

At his startup NextGenVest, Will Falcon treated conversations about financial aid as a game. Using reinforcement learning, the system suggested questions to human agents to maximize the financial aid secured for a student. The system helped students obtain $60 million in aid, showcasing a novel, impactful use of RL.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

"White Glove" Support is a Key Differentiator in the Commoditized GPU Cloud Market

When evaluating NeoCloud partners, Lightning AI found that Voltage Park stood out not just on tech, but on their hyper-responsive "white glove" customer support. This dedication to customer success was the crucial factor that enabled them to land and retain large enterprise clients, proving service can beat specs.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

Lightning AI Merged with GPU Provider Voltage Park to Create a Full-Stack "AI NeoCloud"

The merger combines Lightning AI's software suite with Voltage Park's GPU infrastructure. This vertical integration provides a seamless, cost-effective solution for AI development, from training to deployment, much like Apple controls its hardware and software for a superior user experience.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

Lightning AI's Entire Engineering Team Uses Its Own "Studios" Product for All Development

In a powerful example of dogfooding, every developer at Lightning AI—whether working in Go or Python, on web apps or ML models—codes within the company's "Studios" cloud environment. This validates the product's flexibility and ensures the team directly experiences its strengths and weaknesses, accelerating improvement.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

"NeoClouds" Emerge as GPU-First Alternatives to CPU-Centric Clouds like AWS

A new category of cloud providers, "NeoClouds," are built specifically for high-performance GPU workloads. Unlike traditional clouds like AWS, which were retrofitted from a CPU-centric architecture, NeoClouds offer superior performance for AI tasks by design and through direct collaboration with hardware vendors like NVIDIA.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

Today's Disparate AI Tools are Primitives Awaiting Bundling into a True "AI Cloud"

The current AI landscape, with its many single-purpose tools for inference, vector storage, and training, mirrors the early days of cloud computing. Just as S3 and EC2 were primitives that AWS bundled into a comprehensive cloud, these disparate AI tools will eventually be integrated into a new, cohesive "AI Cloud" platform.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

Open Source Success Can Derail a Creator's Primary Goals

Will Falcon open-sourced PyTorch Lightning to accelerate his own research. However, its rapid adoption forced him to spend nights merging pull requests and adding features for the community, ironically slowing his PhD progress to the point he nearly shut the project down. This serves as a cautionary tale for aspiring creators.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

NeoClouds Risk a "Mortgage Crisis" Scenario Due to High Debt and Unstable Startup Clientele

Many NeoClouds are over-leveraged with loans based on volatile market caps and rely heavily on high-risk startups. This creates a fragile economic model akin to a mortgage crisis, where customer defaults could trigger a cascade of financial problems. Lightning AI mitigates this by being debt-free and focusing on enterprise clients.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

PyTorch Lightning Began as a Personal Neuroscience Research Tool on Theano, Not PyTorch

Before becoming a world-famous library, PyTorch Lightning started as "Research Lib," a personal tool Will Falcon built on Theano to accelerate his undergraduate neuroscience research. Its purpose was to avoid rewriting boilerplate code, allowing him to iterate on scientific ideas faster, demonstrating that powerful tools often solve personal problems first.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago

NYU's CS Department Was a Hidden Incubator for Open Source Giants like PyTorch

Will Falcon notes that NYU, influenced by figures like Yann LeCun, cultivated a strong open-source culture that was instrumental in incubating foundational libraries. Projects like PyTorch, Scikit-learn, and Librosa received significant contributions from people at NYU, revealing the university's quiet but deep impact on the modern AI stack.

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story thumbnail

965: From PhD Side Project to $500M ARR: Will Falcon’s PyTorch Lightning Story

Super Data Science: ML & AI Podcast with Jon Krohn·9 days ago