/

© 2026 RiffOn. All rights reserved.

Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

Super Data Science: ML & AI Podcast with Jon Krohn
977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn · Mar 24, 2026

Prof. Kyunghyun Cho, a pioneer of the attention mechanism, discusses its origin, the future of AI via world models, and active data collection.

The 'Attention' Mechanism in AI Was an Intern's Overnight Idea

The foundational concept for modern LLMs, the attention mechanism, originated from an intern, Dima Badanao, in Yoshua Bengio's lab. The idea was so brilliant that its potential for success was immediately apparent upon explanation, before it was even coded.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

AI Researchers Intentionally Avoided Existing Literature to Create Breakthroughs

To pioneer neural machine translation, Prof. Kyunghyun Cho and his team deliberately limited their review of past research. They believed reading too much would impose false constraints from outdated contexts, preventing them from developing a system from scratch with fresh thinking.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

An Early Neural Model Learned Geopolitics by Translating Parliamentary Proceedings

While demoing an early attention-based translation system, Prof. Cho's team discovered it could fill in an "unknown" country token. Given "unknown Korea is an enemy of United States," it output "North Korea," and with "friend," it output "South Korea," revealing emergent world knowledge.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

North America's Research Superpower is its Deeply Collaborative and International Culture

Prof. Kyunghyun Cho contrasts the "isolated" research styles in Korea and Finland with North America's, which he describes as an "extremely collective affair." He believes the constant influx of global talent automatically fosters a collaborative environment that accelerates innovation, a model he aims to replicate.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

Yoshua Bengio Picked Machine Translation to Force Solutions to Core AI Problems

Prof. Kyunghyun Cho recounts that Yoshua Bengio pushed his lab toward machine translation not just for the task itself, but because it exhibited core AI challenges like handling variable-length sequences and vanishing gradients. Solving translation meant solving these deeper, more general problems.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

NYU Professor Advised His PhD Student to Ditch His PhD for His Startup

When Will Falcon, founder of Lightning AI, wanted to build his company while completing his PhD, his advisor Kyunghyun Cho told him to stop. Cho framed both as "200% jobs," arguing that attempting both would compromise the success of each and advised taking a leave of absence.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

True Sample Efficiency in AI is About Actively Collecting Data, Not Squeezing More from Existing Datasets

Prof. Cho argues that modern models already extract most correlations from passive datasets. The next leap in sample efficiency will come from AI agents that can actively choose what data to collect, intentionally making rare, insightful events ("aha moments") more frequent.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

RAG's Origins Lie in a PhD Student's Quest to Build an Autonomous 'AI Scientist'

The research on re-ranking that influenced Retrieval Augmented Generation (RAG) started with PhD student Rodrigo Nogueira's goal to create an AI researcher. He realized that before an AI could reason, it first needed a scalable way to navigate and retrieve relevant information from vast document sets.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

NYU Professor Teaches Machine Learning by Live-Coding Full Web Apps with AI Agents

Instead of traditional problem sets, Professor Kyunghyun Cho teaches ML algorithms by building a complete web application from scratch using the concept. He demonstrates his entire workflow, including his prompts and interactions with coding agents, to show students how to build real-world systems.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

80% of NYU Computer Science Undergrads Had Never Used an AI Coding Agent

When designing his machine learning course around AI coding agents, NYU Professor Kyunghyun Cho found that the vast majority (80%) of his 200 advanced computer science students had never installed one. This highlights a major adoption gap even among the most tech-savvy students.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago

The Core AI Debate on World Models: High-Fidelity Simulation vs. Abstract Latent Dynamics

Prof. Cho outlines two competing visions for world models. One camp believes in high-fidelity, step-by-step prediction (e.g., video generation). The other, which he and Yann LeCun favor, argues for abstract, high-level latent models that can plan without simulating every detail, akin to human thinking.

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho thumbnail

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Super Data Science: ML & AI Podcast with Jon Krohn·a month ago