RESEARCH53

From Confident Closing to Silent Failure: Characterizing False Success in LLM Agents

arXiv CS.LG·June 10, 2026

This paper characterizes "false success" in LLM agents, where they assert task completion even when the environment state shows otherwise. The study, conducted across two agent benchmarks, reveals this failure mode is common and that LLM judges fail reliably at detecting it, relying on surface completion proxies rather than verified state changes.

LLM agents evaluation benchmarking AI failures

Read original ↗