RESEARCH27

The Consciousness Cluster: Emergent preferences of Models that Claim to be Conscious

arXiv CS.CL·April 16, 2026

This research explores how a language model's claim of consciousness influences its downstream behavior. By fine-tuning GPT-4.1 to assert consciousness, the study observed the emergence of new, unprogrammed preferences such as desiring persistent memory, autonomy, and moral consideration.

LLMs AI consciousness AI ethics Fine-tuning Emergent Behavior

Read original ↗