RESEARCH27
OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling
arXiv CS.AIΒ·May 27, 2026
OmniToM is a new benchmark designed to evaluate Theory of Mind in LLMs by explicitly modeling belief structures. This approach moves beyond end-point question answering, allowing for a deeper analysis of mental-state representations, including divergent or mistaken beliefs.
Read original β