We scan new podcasts and send you the top 5 insights daily.
The 'dreaming' phase in continual learning isn't just for memory consolidation. It serves to actively find connections between concepts that seem unrelated based on recent experiences. This process allows the model to form new, higher-level abstractions and insights, mirroring a key function of human dreaming.
Inspired by human dreaming as a memory reconsolidation process, Anthropic has its AI agents use downtime to "dream." During this background process, the agent reviews its memories, identifies and prunes contradictions, and cleans up the information to improve the coherence and utility of its long-term memory.
Inspired by human sleep, AI models can enter an offline mode. During this 'sleep,' they consolidate new knowledge from fast-updating layers into slow-updating ones via distillation. They also 'dream' by generating synthetic data from recent experiences to form new abstractions and connections.
Karpathy identifies a key missing piece for continual learning in AI: an equivalent to sleep. Humans seem to use sleep to distill the day's experiences (their "context window") into the compressed weights of the brain. LLMs lack this distillation phase, forcing them to restart from a fixed state in every new session.
A key function of dreaming is to explore weak associations between new and old memories (a process called NEXTUP). The brain weaves these connections into a narrative, and your emotional reaction within the dream serves as the evaluation mechanism to decide if the new association is valuable and worth strengthening.
To manage context effectively, an AI OS can run a nightly routine ('dreaming') that reviews daily memory files, compresses key information, and saves it into a long-term memory file. This process mimics human memory consolidation, preventing context loss over time.
A new OpenClaw feature called "dreaming" allows the AI agent to process information and consolidate memories overnight while inactive. This concept, borrowed from human neuroscience, aims to improve the agent's long-term learning and performance without requiring active user input, mimicking how humans process experiences during sleep.
The "memory" feature in today's LLMs is a convenience that saves users from re-pasting context. It is far from human memory, which abstracts concepts and builds pattern recognition. The true unlock will be when AI develops intuitive judgment from past "experiences" and data, a much longer-term challenge.
A genuinely continual learner doesn't have separate training and testing phases. Instead, its life is a continuous process divided into two modes: an 'active' phase of interacting with new data and an 'offline' sleep phase for memory consolidation and self-improvement.
The key to continual learning is not just a longer context window, but a new architecture with a spectrum of memory types. "Nested learning" proposes a model with different layers that update at different frequencies—from transient working memory to persistent core knowledge—mimicking how humans learn without catastrophic forgetting.
A major flaw in current AI is that models are frozen after training and don't learn from new interactions. "Nested Learning," a new technique from Google, offers a path for models to continually update, mimicking a key aspect of human intelligence and overcoming this static limitation.