← heapsort
RESEARCH27

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling

arXiv CS.AIΒ·May 27, 2026

OmniToM is a new benchmark designed to evaluate Theory of Mind in LLMs by explicitly modeling belief structures. This approach moves beyond end-point question answering, allowing for a deeper analysis of mental-state representations, including divergent or mistaken beliefs.

Read original β†—