RESEARCH27
Domain-level metacognitive monitoring in frontier LLMs: A 33-model atlas
arXiv CS.CLΒ·May 11, 2026
This research paper presents an atlas of domain-level metacognitive monitoring across 33 frontier LLMs, analyzing 1,500 MMLU items across six domains. It reveals significant within-model variation, with Applied/Professional knowledge being the easiest and Formal Reasoning/Natural Science the hardest domains to monitor.
Read original β