← heapsort
RESEARCH27

Distillation that keeps confidence honest

DEV.to AIΒ·May 10, 2026

Traditional on-policy distillation (OPD) causes smaller student models to exhibit overconfidence due to the larger teacher model's access to privileged context. New research formalizes this mismatch and proposes CaOPD to rectify this certainty illusion without sacrificing accuracy gains.

Read original β†—