RESEARCHarXiv CS.LG·5/6/2026
StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing
StateSMix is a self-contained lossless compressor that couples an online-trained Mamba-style State Space Model (SSM) with sparse n-gram context mixing and arithmetic coding. It is initialized from scratch and trained token-by-token on the file, requiring no pre-trained weights, GPU, or external dependencies, achieving competitive results on the enwik8 benchmark.
27