In-Context Learning Is Simply Real-Time Bayesian Updating Based on Prompt Evidence

Related Insights

LLMs Function as Compressed Representations of an Impossibly Large and Sparse Probability Matrix

A useful mental model for an LLM is a giant matrix where each row is a possible prompt and columns represent next-token probabilities. This matrix is impossibly large but also extremely sparse, as most token combinations are gibberish. The LLM's job is to efficiently compress and approximate this matrix.

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

The a16z Show·3 months ago

Early LLMs Learned by Simply Predicting the Next Word in 7,000 Books

In a 2018 interview, OpenAI's Greg Brockman described their foundational training method: ingesting thousands of books with the sole task of predicting the next word. This simple predictive objective was the key that unlocked complex, generalizable language understanding in their models.

Why We Need Ferries and Tugboats in Space w/ Orbital Operations | E2208

This Week in Startups·7 months ago

AI Fails From Lack of Context, Not Poor Prompts

People struggle with AI prompts because the model lacks background on their goals and progress. The solution is 'Context Engineering': creating an environment where the AI continuously accumulates user-specific information, materials, and intent, reducing the need for constant prompt tweaking.

Context Engineering: The Secret Behind $10M ARR in 60 Days, with Kuse Founder Xiankun Wu

Product Growth Podcast·7 months ago

The Next LLM Leap Will Be Models That Learn From Experience, Not Just Scale Up

The current limitation of LLMs is their stateless nature; they reset with each new chat. The next major advancement will be models that can learn from interactions and accumulate skills over time, evolving from a static tool into a continuously improving digital colleague.

Synthetic Data and the Future of AI | Cohere CEO Aidan Gomez

Grit·7 months ago

In-Context Learning May Be a Form of Internal Gradient Descent

Contrary to the view that in-context learning is a distinct process from training, Karpathy speculates it might be an emergent form of gradient descent happening within the model's layers. He cites papers showing that transformers can learn to perform linear regression in-context, with internal mechanics that mimic an optimization loop.

Andrej Karpathy — AGI is still a decade away

Dwarkesh Podcast·8 months ago

Activation Steering and In-Context Learning Might Be Formally Equivalent

Research suggests a formal equivalence between modifying a model's internal activations (steering) and providing prompt examples (in-context learning). This framework could potentially create a formula to convert between the two techniques, even for complex behaviors like jailbreaks.

The First Mechanistic Interpretability Frontier Lab — Myra Deng & Mark Bissell of Goodfire AI

Latent Space: The AI Engineer Podcast·4 months ago

LLMs Can Memorize Data After a Single Training Pass, Defying Common ML Intuition

Contrary to the belief that memorization requires multiple training epochs, large language models demonstrate the capacity to perfectly recall specific information after seeing it only once. This surprising phenomenon highlights how understudied the information theory behind LLMs still is.

[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka

Latent Space: The AI Engineer Podcast·4 months ago

Prompt Optimizer JEPA Failed to Outperform RL Fine-Tuning in OpenPipe's Tests

While prompt optimization is theoretically appealing, OpenPipe's team evaluated methods like JEPA and found they provided only minor boosts. Their RL fine-tuning methods delivered vastly superior results (96% vs 56% on a benchmark), suggesting weight updates still trump prompt engineering for complex tasks.

Why Fine-Tuning Lost and RL Won

Latent Space: The AI Engineer Podcast·8 months ago

View LLM Imitation Learning as Reinforcement Learning with a One-Token Horizon

The distinction between imitation learning and reinforcement learning (RL) is not a rigid dichotomy. Next-token prediction in LLMs can be framed as a form of RL where the "episode" is just one token long and the reward is based on prediction accuracy. This conceptual model places both learning paradigms on a continuous spectrum rather than in separate categories.

Some thoughts on the Sutton interview

Dwarkesh Podcast·8 months ago

Anthropic's Claude Skills Combat 'Context Rot' by Loading Task-Specific Information On-Demand

Overloading LLMs with excessive context degrades performance, a phenomenon known as 'context rot'. Claude Skills address this by loading context only when relevant to a specific task. This laser-focused approach improves accuracy and avoids the performance degradation seen in broader project-level contexts.

Claude Skills: The NEW Way to Build AI Agents (Live Tutorial)

The Startup Ideas Podcast·8 months ago

Get your free personalized podcast brief

Related Insights