RESEARCH27

Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges

arXiv CS.CL·May 26, 2026

This paper introduces a causal framework to study rationalization bias in LLMs used as automatic judges for summarization and dialogue evaluation. It investigates whether LLM judges' rankings and explanations remain stable when non-evidential cues are perturbed, proposing cue interventions and anchoring metrics.

LLMs evaluation AI rationalization Bias

Read original ↗