When Should a Language Model Trust Itself? Same-Model Self-Verification as a Conditional Confidence Signal
This research evaluates same-model self-verification as a confidence signal for selective prediction, comparing it against likelihood-based baselines. The study reveals task- and model-dependent results, showing significant improvements for some models on ARC-Challenge but less reliability and occasional degradation on TruthfulQA-MC.




