RESEARCH27
Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges
arXiv CS.CLΒ·May 26, 2026
This paper introduces a causal framework to study rationalization bias in LLMs used as automatic judges for summarization and dialogue evaluation. It investigates whether LLM judges' rankings and explanations remain stable when non-evidential cues are perturbed, proposing cue interventions and anchoring metrics.
Read original β