RESEARCH29
Structural Instability of Feature Composition
arXiv CS.LGΒ·May 8, 2026
This paper presents a geometric framework to analyze the instability of feature unions in Sparse Autoencoders (SAEs), particularly concerning compositional steering. It derives an asymptotic compositional-collapse threshold under a spherical dictionary model.
Read original β