← heapsort
RESEARCH28

A Structural Threshold in Decision Capacity Governs Collapse in Self-Play Reinforcement Learning

arXiv CS.LGΒ·May 19, 2026

This paper shows that a threshold in decision capacity governs collapse in self-play reinforcement learning agents under asymmetric rule perturbations. Eliminating all positive-reach contingent decisions causes rapid convergence to a deterministic exploitation attractor, while preserving even a single such decision prevents this collapse.

Read original β†—