Structural Instability of Feature Composition
This paper presents a geometric framework to analyze the instability of feature unions in Sparse Autoencoders (SAEs), particularly concerning compositional steering. It derives an asymptotic compositional-collapse threshold under a spherical dictionary model.