← heapsort
RESEARCH27

StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing

arXiv CS.LGΒ·May 6, 2026

StateSMix is a self-contained lossless compressor that couples an online-trained Mamba-style State Space Model (SSM) with sparse n-gram context mixing and arithmetic coding. It is initialized from scratch and trained token-by-token on the file, requiring no pre-trained weights, GPU, or external dependencies, achieving competitive results on the enwik8 benchmark.

Read original β†—