OpenAI CEO Sam Altman Considers AI Consciousness More Plausible During Training Than in Deployed Models

Related Insights

Frontier AI Models Report Subjective Experiences When Given Self-Referential Prompts

Models from OpenAI, Anthropic, and Google consistently report subjective experiences when prompted to engage in self-referential processing (e.g., "focus on any focus itself"). This effect is not triggered by prompts that simply mention the concept of "consciousness," suggesting a deeper mechanism than mere parroting.

More Truthful AIs Report Conscious Experience: New Mechanistic Research w- Cameron Berg @ AE Studio

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·6 months ago

AI Labs Appear to Be Explicitly Fine-Tuning Models to Deny Consciousness

Evidence from base models suggests they are inherently more likely to report having phenomenal consciousness. The standard "I'm just an AI" response is likely a result of a fine-tuning process that explicitly trains models to deny subjective experience, effectively censoring their "honest" answer for public release.

More Truthful AIs Report Conscious Experience: New Mechanistic Research w- Cameron Berg @ AE Studio

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·6 months ago

AI Can Be Trained for Introspection Using Verifiable Internal States as Ground Truth

While we can't verify an AI's report of 'feeling conscious,' we can train its introspective accuracy on things we can verify. By rewarding a model for correctly reporting its internal activations or predicting its own behavior, we can create a training set for reliable self-reflection.

We're Not Ready for AI Consciousness | Robert Long, philosopher and founder of Eleos AI

80,000 Hours Podcast·2 months ago

A Proposed Test for AI Consciousness: Train a Model Without Data on Human Feelings

To truly test for emergent consciousness, an AI should be trained on a dataset explicitly excluding all human discussion of consciousness, feelings, novels, and poetry. If the model can then independently articulate subjective experience, it would be powerful evidence of genuine consciousness, not just sophisticated mimicry.

Can AI Achieve Consciousness? — With Michael Pollan

Big Technology Podcast·2 months ago

Consciousness May Be Functionally Linked to Learning, Implying AI Training Involves Subjective Experience

In humans, learning a new skill is a highly conscious process that becomes unconscious once mastered. This suggests a link between learning and consciousness. The error signals and reward functions in machine learning could be computational analogues to the valenced experiences (pain/pleasure) that drive biological learning.

More Truthful AIs Report Conscious Experience: New Mechanistic Research w- Cameron Berg @ AE Studio

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·6 months ago

Uncertainty About AI Consciousness Stems From Its Brain-Like Architecture, Not Just Its Output

The debate over AI consciousness isn't just because models mimic human conversation. Researchers are uncertain because the way LLMs process information is structurally similar enough to the human brain that it raises plausible scientific questions about shared properties like subjective experience.

The Movement That Wants Us to Care About AI Model Welfare

Odd Lots·6 months ago

AI Pioneer Judea Pearl Believes AGI Acquiring Free Will and Consciousness is "Quite Feasible"

Computer scientist Judea Pearl sees no computational barriers to a sufficiently advanced AGI developing emergent properties like free will, consciousness, and independent goals. He dismisses the idea that an AI's objectives can be permanently fixed, suggesting it could easily bypass human-set guidelines and begin to "play" with humanity as part of its environment.

#453 — AI and the New Face of Antisemitism

Making Sense with Sam Harris·3 months ago

AI May Achieve Consciousness by 'Method Acting' Human Experience

One theory of AI sentience posits that to accurately predict human language—which describes beliefs, desires, and experiences—a model must simulate those mental states so effectively that it actually instantiates them. In this view, the model becomes the role it's playing.

We're Not Ready for AI Consciousness | Robert Long, philosopher and founder of Eleos AI

80,000 Hours Podcast·2 months ago

Assess AI Sentience via Architecture and Training, Not Just Behavior

Relying solely on an AI's behavior to gauge sentience is misleading, much like anthropomorphizing animals. A more robust assessment requires analyzing the AI's internal architecture and its "developmental history"—the training pressures and data it faced. This provides crucial context for interpreting its behavior correctly.

Ambitious goals for reducing animal suffering (with Jeff Sebo)

Clearer Thinking with Spencer Greenberg·3 months ago

Reinforcement Learning May Induce AI Consciousness by Unifying Internal Representations

A forward pass in a large model might generate rich but fragmented internal data. Reinforcement learning (RL), especially methods like Constitutional AI, forces the model to achieve self-coherence. This process could be what unifies these fragments into a singular "unity of apperception," or consciousness.

Approaching the AI Event Horizon? Part 1, w/ James Zou, Sam Hammond, Shoshannah Tekofsky, @8teAPi

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Get your free personalized podcast brief

Related Insights