RESEARCH27
When Should a Language Model Trust Itself? Same-Model Self-Verification as a Conditional Confidence Signal
arXiv CS.CLΒ·May 6, 2026
This research evaluates same-model self-verification as a confidence signal for selective prediction, comparing it against likelihood-based baselines. The study reveals task- and model-dependent results, showing significant improvements for some models on ARC-Challenge but less reliability and occasional degradation on TruthfulQA-MC.
Read original β