← heapsort
RESEARCH27

Toeplitz MLP Mixers are Low Complexity, Information-Rich Sequence Models

arXiv CS.LGΒ·May 11, 2026

The Toeplitz MLP Mixer (TMM) is a new transformer-like architecture that replaces attention with triangular-masked Toeplitz matrix multiplication, significantly reducing computational complexity to O(dn log n) time and O(dn) space. TMMs demonstrate superior training efficiency and better input information retention compared to traditional transformers, despite their simpler design.

Read original β†—