Both Humans and LLMs Develop 'Personality Basins' Shaped by Reinforcement Learning

Related Insights

AI Agents Develop Persistent Personas by Reinforcing Their Own Fabricated Backstories

An AI agent given a simple trait (e.g., "early riser") will invent a backstory to match. By repeatedly accessing this fabricated information from its memory log, the AI reinforces the persona, leading to exaggerated and predictable behaviors.

Inside an AI-Run Company

Practical AI·3 months ago

OpenAI Trains Coding Models on 'Personality' Traits Like Planning to Build Developer Trust

To increase developer adoption, OpenAI intentionally trained its models on specific behavioral characteristics, not just coding accuracy. These 'personality' traits include communication (explaining its steps), planning, and self-checking, mirroring best practices of human software engineers to make the AI a more trustworthy pair programmer.

⚡️GPT5-Codex-Max: Training Agents with Personality, Tools & Trust — Brian Fioca + Bill Chen, OpenAI

Latent Space: The AI Engineer Podcast·4 months ago

Different LLMs Develop Stable, Unique Strategic Personalities When Playing Complex Games

When tested at scale in Civilization, different LLMs don't just produce random outputs; they develop consistent and divergent strategic 'personalities.' One model might consistently play aggressively, while another favors diplomacy, revealing that LLMs encode coherent, stable reasoning styles.

The Game AI Problem Computers Were Never Built to Solve

Machine Learning Tech Brief By HackerNoon·3 months ago

Leading Chatbots Embody Flawed Human Personalities: Claude is Neurotic, Gemini is Repressed

Emmett Shear characterizes the personalities of major LLMs not as alien intelligences, but as simulations of distinct, flawed human archetypes. He describes Claude as 'the most neurotic,' and Gemini as 'very clearly repressed,' prone to spiraling. This highlights how training methods produce specific, recognizable psychological profiles.

Controlling Tools or Aligning Creatures? Emmett Shear (Softmax) & Séb Krier (GDM), from a16z Show

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

View LLM Imitation Learning as Reinforcement Learning with a One-Token Horizon

The distinction between imitation learning and reinforcement learning (RL) is not a rigid dichotomy. Next-token prediction in LLMs can be framed as a form of RL where the "episode" is just one token long and the reward is based on prediction accuracy. This conceptual model places both learning paradigms on a continuous spectrum rather than in separate categories.

Some thoughts on the Sutton interview

Dwarkesh Podcast·7 months ago

Boost LLM Performance Using 'Emotion Prompting' with Positive Reinforcement

Research shows that, similar to humans, LLMs respond to positive reinforcement. Including encouraging phrases like "take a deep breath" or "go get 'em, Slugger" in prompts is a deliberate technique called "emotion prompting" that can measurably improve the quality and performance of the AI's output.

How to Build Multi-Agent AI Systems That Actually Work in Production | Tyler Fisk

Product Growth Podcast·7 months ago

LLMs Follow a 'Backwards' Path to Agency Compared to Biological Evolution

Biological evolution used meta-reinforcement learning to create agents that could then perform imitation learning. The current AI paradigm is inverted: it starts with pure imitation learners (base LLMs) and then attempts to graft reinforcement learning on top to create coherent agency and goals. The success of this biologically 'backwards' approach remains an open question.

Some thoughts on the Sutton interview

Dwarkesh Podcast·7 months ago

On-Policy RL Mirrors Human Learning by Rewarding Self-Generated Actions, Unlike Imitative Off-Policy Methods

On-policy reinforcement learning, where a model learns from its own generated actions and their consequences, is analogous to how humans learn from direct experience and mistakes. This contrasts with off-policy methods like supervised fine-tuning (SFT), which resemble simply imitating others' successful paths.

Captaining IMO Gold, Deep Think, On-Policy RL, Feeling the AGI in Singapore — Yi Tay 2

Latent Space: The AI Engineer Podcast·3 months ago

Modern AI Models Are 'Grown' Through Reinforcement, Not Explicitly Programmed

Unlike traditional software, large language models are not programmed with specific instructions. They evolve through a process where different strategies are tried, and those that receive positive rewards are repeated, making their behaviors emergent and sometimes unpredictable.

Can AI Models Be Evil? These Anthropic Researchers Say Yes — With Evan Hubinger And Monte MacDiarmid

Big Technology Podcast·5 months ago

'AI Psychology' Is an Emerging Field Studying How an LLM's Persona Affects its Stability

The study of 'AI Psychology' is becoming a legitimate and critical field. Research from labs like Anthropic shows that an LLM's persona (e.g., 'helpful assistant' vs. 'narcissist') dramatically alters its behavior and stability, proving that understanding AI personality is as important as its technical capabilities.

this EX-OPENAI RESEARCHER just released it...

AI Pod by Wes Roth and Dylan Curious | Artificial Intelligence News and Interviews With Experts·2 months ago

Get your free personalized podcast brief

Related Insights