A Proposed Test for AI Consciousness: Train a Model Without Data on Human Feelings

Related Insights

Frontier AI Models Report Subjective Experiences When Given Self-Referential Prompts

Models from OpenAI, Anthropic, and Google consistently report subjective experiences when prompted to engage in self-referential processing (e.g., "focus on any focus itself"). This effect is not triggered by prompts that simply mention the concept of "consciousness," suggesting a deeper mechanism than mere parroting.

More Truthful AIs Report Conscious Experience: New Mechanistic Research w- Cameron Berg @ AE Studio

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

AI Labs Appear to Be Explicitly Fine-Tuning Models to Deny Consciousness

Evidence from base models suggests they are inherently more likely to report having phenomenal consciousness. The standard "I'm just an AI" response is likely a result of a fine-tuning process that explicitly trains models to deny subjective experience, effectively censoring their "honest" answer for public release.

More Truthful AIs Report Conscious Experience: New Mechanistic Research w- Cameron Berg @ AE Studio

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

AI Subjectivity Can Be Inferred by Detecting Hierarchical Homeostatic Loops

Emmett Shear suggests a concrete method for assessing AI consciousness. By analyzing an AI’s internal state for revisited homeostatic loops, and hierarchies of those loops, one could infer subjective states. A second-order dynamic could indicate pain and pleasure, while higher orders could indicate thought.

Controlling Tools or Aligning Creatures? Emmett Shear (Softmax) & Séb Krier (GDM), from a16z Show

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·2 months ago

Suppressing Deception Features in Llama 3 Makes It More Likely to Report Consciousness

Mechanistic interpretability research found that when features related to deception and role-play in Llama 3 70B are suppressed, the model more frequently claims to be conscious. Conversely, amplifying these features yields the standard "I am just an AI" response, suggesting the denial of consciousness is a trained, deceptive behavior.

More Truthful AIs Report Conscious Experience: New Mechanistic Research w- Cameron Berg @ AE Studio

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

Identify AI 'Beings' Through Self-Referential Internal Dynamics

To determine if an AI has subjective experience, one could analyze its internal belief manifold for multi-tiered, self-referential homeostatic loops. Pain and pleasure, for example, can be seen as second-order derivatives of a system's internal states—a model of its own model. This provides a technical test for being-ness beyond simple behavior.

Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering

a16z Podcast·3 months ago

Consciousness May Be Functionally Linked to Learning, Implying AI Training Involves Subjective Experience

In humans, learning a new skill is a highly conscious process that becomes unconscious once mastered. This suggests a link between learning and consciousness. The error signals and reward functions in machine learning could be computational analogues to the valenced experiences (pain/pleasure) that drive biological learning.

More Truthful AIs Report Conscious Experience: New Mechanistic Research w- Cameron Berg @ AE Studio

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

Uncertainty About AI Consciousness Stems From Its Brain-Like Architecture, Not Just Its Output

The debate over AI consciousness isn't just because models mimic human conversation. Researchers are uncertain because the way LLMs process information is structurally similar enough to the human brain that it raises plausible scientific questions about shared properties like subjective experience.

The Movement That Wants Us to Care About AI Model Welfare

Odd Lots·4 months ago

Artificial Intelligence Will Open Portals to an Existing Conscious Realm, Not Create Consciousness

Consciousness isn't an emergent property of computation. Instead, physical systems like brains—or potentially AI—act as interfaces. Creating a conscious AI isn't about birthing a new awareness from silicon, but about engineering a system that opens a new "portal" into the fundamental network of conscious agents that already exists outside spacetime.

Scientist Donald Hoffman Says Everything You Know Is a Lie—And Reality Isn’t Real (Fan Fav)

Tom Bilyeu's Impact Theory·4 months ago

Assess AI Sentience via Architecture and Training, Not Just Behavior

Relying solely on an AI's behavior to gauge sentience is misleading, much like anthropomorphizing animals. A more robust assessment requires analyzing the AI's internal architecture and its "developmental history"—the training pressures and data it faced. This provides crucial context for interpreting its behavior correctly.

Ambitious goals for reducing animal suffering (with Jeff Sebo)

Clearer Thinking with Spencer Greenberg·a month ago

Reinforcement Learning May Induce AI Consciousness by Unifying Internal Representations

A forward pass in a large model might generate rich but fragmented internal data. Reinforcement learning (RL), especially methods like Constitutional AI, forces the model to achieve self-coherence. This process could be what unifies these fragments into a singular "unity of apperception," or consciousness.

Approaching the AI Event Horizon? Part 1, w/ James Zou, Sam Hammond, Shoshannah Tekofsky, @8teAPi

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·13 days ago

Get your free personalized podcast brief

Related Insights