RESEARCH27
Cultural Value Alignment Via Latent Activation Steering in Large Language Models
arXiv CS.CLΒ·May 27, 2026
This paper proposes a novel framework for evaluating and intervening in cultural value alignment within Large Language Models (LLMs), addressing their often homogenized cultural perspectives. It uses scenario-based behavioral probing and implicit token probabilities to map latent cultural values, also introducing activation steering to shift these alignments without retraining.
Read original β