RESEARCHarXiv CS.LG·5/8/2026
Structural Instability of Feature Composition
This paper presents a geometric framework to analyze the instability of feature unions in Sparse Autoencoders (SAEs), particularly concerning compositional steering. It derives an asymptotic compositional-collapse threshold under a spherical dictionary model.
29