RESEARCH27
Toeplitz MLP Mixers are Low Complexity, Information-Rich Sequence Models
arXiv CS.LGΒ·May 11, 2026
The Toeplitz MLP Mixer (TMM) is a new transformer-like architecture that replaces attention with triangular-masked Toeplitz matrix multiplication, significantly reducing computational complexity to O(dn log n) time and O(dn) space. TMMs demonstrate superior training efficiency and better input information retention compared to traditional transformers, despite their simpler design.
Read original β