RESEARCH27

When Does LLM Self-Correction Help? A Control-Theoretic Markov Diagnostic and Verify-First Intervention

arXiv CS.AI·April 27, 2026

This research frames LLM self-correction as a cybernetic feedback loop, using a two-state Markov model to determine when iterative refinement helps versus hurts. It identifies a critical EIR threshold (<= 0.5%) separating beneficial from harmful self-correction, showing that only a few models improve, while others like GPT-5 degrade.

LLMs self-correction Benchmarking AI agents

Read original ↗