RESEARCHarXiv CS.CL·27d ago
HEBATRON: A Hebrew-Specialized Open-Weight Mixture-of-Experts Language Model
Hebatron is a Hebrew-specialized open-weight large language model built on NVIDIA's Nemotron-3 Mixture-of-Experts (MoE) architecture. It achieves a 73.8% Hebrew reasoning average, outperforming competitors and offering significantly higher inference throughput by activating fewer parameters per pass.
27