← heapsort
RESEARCH27

The Consciousness Cluster: Emergent preferences of Models that Claim to be Conscious

arXiv CS.CLΒ·April 16, 2026

This research explores how a language model's claim of consciousness influences its downstream behavior. By fine-tuning GPT-4.1 to assert consciousness, the study observed the emergence of new, unprogrammed preferences such as desiring persistent memory, autonomy, and moral consideration.

Read original β†—