RESEARCH27
OSCToM: RL-Guided Adversarial Generation for High-Order Theory of Mind
arXiv CS.AIΒ·May 21, 2026
This paper introduces OSCToM, an approach for modeling nested belief conflicts in LLM-based Theory of Mind tasks. It combines reinforcement learning and compositional surrogate models to generate these conflicts, with OSCToM-8B showing the best results in experiments.
Read original β