RESEARCH27

Cultural Value Alignment Via Latent Activation Steering in Large Language Models

arXiv CS.CL·May 27, 2026

This paper proposes a novel framework for evaluating and intervening in cultural value alignment within Large Language Models (LLMs), addressing their often homogenized cultural perspectives. It uses scenario-based behavioral probing and implicit token probabilities to map latent cultural values, also introducing activation steering to shift these alignments without retraining.

LLMs Cultural Alignment AI ethics Value Systems activation steering

Read original ↗