← heapsort
RESEARCH27

The Impact of Vocabulary Overlaps on Knowledge Transfer in Multilingual Machine Translation

arXiv CS.CLΒ·May 7, 2026

This paper systematically investigates the impact of joint and disjoint vocabularies on knowledge transfer in multilingual neural machine translation (MNMT). Experiments show that extensive vocabulary overlaps, language relatedness, and domain-match lead to better performance, even in out-of-domain setups.

Read original β†—