RESEARCH27
Efficient Matrix Implementation for Rotary Position Embedding
arXiv CS.LGΒ·April 14, 2026
This research proposes RoME, a novel and computationally efficient reformulation of Rotary Position Embedding (RoPE), a core component in modern Transformer architectures. By replacing vector-level operations with unified matrix transformations, RoME significantly reduces computational overhead and improves hardware utilization.
Matrix operationsRotary Position EmbeddingNPU optimizationComputational EfficiencyTransformer architectures
Read original β