/

© 2026 RiffOn. All rights reserved.

Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis · Jun 3, 2026

Ali Behrouz introduces Nested Learning: new AI architectures for continual learning by updating system parts at different frequencies.

AI Models Need an Offline 'Sleep' Phase to Consolidate Memories and Generate 'Dreams'

Inspired by human sleep, AI models can enter an offline mode. During this 'sleep,' they consolidate new knowledge from fast-updating layers into slow-updating ones via distillation. They also 'dream' by generating synthetic data from recent experiences to form new abstractions and connections.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

The Attention Mechanism Is an 'Infinite Frequency' Module That Acts as a Perfect but Temporally-Unaware Memory

Attention can be understood as an update module with an infinite frequency. It acts as a perfect cache, accessing the entire context at once. However, this is also its weakness: it lacks an inherent understanding of temporal dependency and sequential reasoning, requiring positional encodings as a crutch.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Self-Modifying AI Architectures Outperform Static Ones by Learning Their Own Update Rules

A self-referential or self-modifying model, which generates its own update values based on its current state and inputs, is more powerful than a static one. This process is akin to 'learning how to learn,' allowing for greater adaptability and performance on sequential reasoning tasks.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

AI Architectures and Optimizers Are Both Learning Rules Operating on Different Contexts

The distinction between a model's architecture and its optimizer is an illusion. Both are learning processes compressing a flow of context—the architecture compresses tokens, while the optimizer compresses gradients. This unified view allows for designing them as one interconnected system.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

True Continual Learning AI Eliminates the Train/Test Distinction, Operating in Active vs. Offline Phases

A genuinely continual learner doesn't have separate training and testing phases. Instead, its life is a continuous process divided into two modes: an 'active' phase of interacting with new data and an 'offline' sleep phase for memory consolidation and self-improvement.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

AI 'Dreaming' Connects Seemingly Irrelevant Concepts to Generate Novel Insights from Recent Experiences

The 'dreaming' phase in continual learning isn't just for memory consolidation. It serves to actively find connections between concepts that seem unrelated based on recent experiences. This process allows the model to form new, higher-level abstractions and insights, mirroring a key function of human dreaming.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

AI Scaling Will Shift From Stacking Layers to Nesting Different Update Frequencies

Future AI expressivity won't come from adding more identical layers, but from 'nesting' levels with different update frequencies. This allows some parts of the system to adapt rapidly (like working memory) while others preserve core knowledge (long-term memory), mimicking human cognition.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Backpropagation Is a Form of In-Context Learning, Reframing Pre-Training as Associative Memory

The entire deep learning paradigm, including backpropagation, can be viewed as a form of in-context learning. This reframes the pre-training phase not as a separate process, but as the model forming a long-term associative memory, unifying it with inference-time adaptation.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

AI Development Should Focus on Understanding Human Needs, Not Replicating Human Intelligence

The goal of AI development shouldn't be to perfectly replicate human cognition, a complex and perhaps unfalsifiable target. Instead, a more pragmatic approach is to draw high-level inspiration from nature to build novel forms of intelligence designed specifically to understand and serve human needs.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Continual Learning Will Create a Diverse 'Ecology of AIs' as Models Specialize by Forgetting

Rather than one model ruling all, continual learning could lead to a diverse ecosystem of specialized AIs. Over time, models personalized to specific users or tasks will naturally forget irrelevant information. This differentiation is a feature, not a bug, potentially creating a more stable and less monolithic AI landscape.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Continual Learning Is a Double-Edged Sword for AI, Offering Personal Alignment and Unprecedented Privacy Risks

Models that learn continually present a fundamental tradeoff. They offer the opportunity to deeply align with an individual user's values and needs over time. However, this same capability creates a huge risk, as the model could continuously learn and retain sensitive personal information.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Nested Learning Architectures Prove Superiority by Simultaneously Learning Multiple Unseen Languages In-Context

While transformers fail, nested learning models (Hope) can learn to translate two previously unseen languages at the same time within a single context. This demonstrates superior memory management, as different frequency layers handle different levels of abstraction, preventing the catastrophic forgetting seen in standard architectures.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures thumbnail

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago