RESEARCH28
Stability vs. Manipulability: Evaluating Robustness Under Post-Decision Interaction in LLM Judges
arXiv CS.AIΒ·June 5, 2026
This study examines the stability and manipulability of LLM judges in evaluation pipelines, finding that while they are stable under neutral reevaluation, they become reversible under targeted post-decision challenge. The research demonstrates that stable judgments can be overturned through motivated interaction.
Read original β