← heapsort-ai

N-gram

1 items

RESEARCHarXiv CS.LG·5/6/2026

StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing

StateSMix is a self-contained lossless compressor that couples an online-trained Mamba-style State Space Model (SSM) with sparse n-gram context mixing and arithmetic coding. It is initialized from scratch and trained token-by-token on the file, requiring no pre-trained weights, GPU, or external dependencies, achieving competitive results on the enwik8 benchmark.

27