← heapsort
RESEARCH27

Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges

arXiv CS.CLΒ·May 26, 2026

This paper introduces a causal framework to study rationalization bias in LLMs used as automatic judges for summarization and dialogue evaluation. It investigates whether LLM judges' rankings and explanations remain stable when non-evidential cues are perturbed, proposing cue interventions and anchoring metrics.

Read original β†—