RESEARCH27

OSCToM: RL-Guided Adversarial Generation for High-Order Theory of Mind

arXiv CS.AI·May 21, 2026

This paper introduces OSCToM, an approach for modeling nested belief conflicts in LLM-based Theory of Mind tasks. It combines reinforcement learning and compositional surrogate models to generate these conflicts, with OSCToM-8B showing the best results in experiments.

LLMs reinforcement learning AI Research Theory of Mind

Read original ↗