← heapsort
RESEARCH27

Continual Distillation of Teachers from Different Domains

arXiv CS.LGΒ·May 7, 2026

This research introduces Continual Distillation (CD), a new paradigm where a student model sequentially learns from a stream of teacher models without retaining prior access. It addresses challenges like unseen knowledge transfer (UKT) and forgetting (UKF) through Self External Data Distillation (SE2D), which uses external unlabeled data to stabilize learning across heterogeneous teachers.

Read original β†—