RESEARCH27

Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching

arXiv CS.LG·April 13, 2026

This paper introduces a distillation framework to make large genomic foundation models for mRNA representation learning more efficient, reducing model size by 200-fold. By using embedding-level distillation, the smaller model achieves state-of-the-art performance on mRNA-related tasks, demonstrating an effective strategy for scalable biological AI.

mRNA Foundation Models Model Distillation representation learning Genomic Models

Read original ↗