/

© 2026 RiffOn. All rights reserved.

Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis · Feb 22, 2026

Minimax's Olive Song discusses building powerful open-weight models using reinforcement learning, expert feedback, and interleaved thinking.

Minimax Uses an Internal AI Agent to Triage and Summarize AI Research Papers

To manage the overwhelming pace of AI advancements, the Minimax team built an internal AI agent. This tool automatically tracks new articles, papers, and blogs, then dispatches, summarizes, and analyzes them. This "internal researcher" filters the information firehose for the human team.

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post thumbnail

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 days ago

Minimax Uses "Interleaved Thinking" to Improve Agent Performance in Noisy Environments

Instead of a single "think then act" cycle, Minimax trains its M2 model to repeatedly pause and rethink after receiving feedback from the environment. This iterative "interleaved thinking" approach improves robustness and performance on long-horizon tasks where tool responses or conditions are unpredictable.

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post thumbnail

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 days ago

Agent Generalization Requires Perturbing the Entire Operational Space, Not Just Tool Scaling

Minimax discovered that robust AI agent generalization comes from systematically varying the model's entire operational environment—including system prompts, chat templates, and tool responses—not just by increasing the number of tools it's trained on. They use a dedicated perturbation pipeline to ensure this variance.

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post thumbnail

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 days ago

Frontier AI Research Shifts from Applying Papers to First-Principles Problem Solving

A Minimax researcher explains that unlike academia, work at the industry's frontier involves problems so new that no literature exists. The job shifts from applying existing papers to deep, fundamental, first-principles thinking to find novel solutions for entirely unsolved challenges.

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post thumbnail

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 days ago

Minimax Scales Reward Models Using In-House "Expert Developers" for Precise Feedback

Minimax enhances its reinforcement learning process by treating its own expert developers as scalable reward models. These developers participate directly in the training cycle, identifying desirable behaviors and providing precise feedback on complex coding tasks, which creates a model tailored to professional workflows.

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post thumbnail

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 days ago

Minimax's In-House Apps Create Tight Feedback Loops for AI Model Development

Minimax builds both foundation models and user-facing applications in-house. This structure enables research and engineering teams to work side-by-side, getting direct feedback from internal developers to rapidly identify and address model weaknesses, ensuring models meet real-world needs.

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post thumbnail

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 days ago

The Daily Reality of Frontier AI Research Is an Emotional Rollercoaster of "ICU and KTV"

A researcher from Minimax describes the volatile nature of training large models, where a single day can swing dramatically between highs and lows. They joke about having "ICU in the morning and then KTV at night," reflecting how promising results can suddenly turn into critical bugs, and vice versa.

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post thumbnail

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 days ago

Minimax Found Reinforcement Learning for LLMs Requires Higher FP32 Precision

While debugging stalled model accuracy, Minimax's team found that running the LM head in FP32 precision during reinforcement learning was critical. Lower precision created a gap between the theoretical algorithm and practical implementation, preventing the model from improving and highlighting the importance of low-level engineering details.

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post thumbnail

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 days ago