RESEARCH53
From Confident Closing to Silent Failure: Characterizing False Success in LLM Agents
arXiv CS.LGΒ·June 10, 2026
This paper characterizes "false success" in LLM agents, where they assert task completion even when the environment state shows otherwise. The study, conducted across two agent benchmarks, reveals this failure mode is common and that LLM judges fail reliably at detecting it, relying on surface completion proxies rather than verified state changes.
Read original β