← heapsort-ai

Model Distillation

8 items

RESEARCHarXiv CS.LG·4/15/2026

Disposition Distillation at Small Scale: A Three-Arc Negative Result

This paper details an attempt to distill behavioral dispositions into small language models (0.6B-2.3B parameters) through a distillation pipeline. Initial reported gains were later falsified due to evaluation artifacts, resulting in a negative outcome for the core hypothesis and leading to three subsequent arcs of investigation.

27
RESEARCHarXiv CS.LG·4/13/2026

Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching

This paper introduces a distillation framework to make large genomic foundation models for mRNA representation learning more efficient, reducing model size by 200-fold. By using embedding-level distillation, the smaller model achieves state-of-the-art performance on mRNA-related tasks, demonstrating an effective strategy for scalable biological AI.

27