RESEARCHarXiv CS.AI·7d ago
Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models
This paper evaluates "harmful overthinking" in Large Reasoning Models, where continued reasoning after a correct answer can destabilize a correct trajectory. It introduces a protocol to distinguish verbose from harmful overthinking, finding issues in multimodal benchmarks.
27