RESEARCH28

ReAD: Reinforcement-Guided Capability Distillation for Large Language Models

arXiv CS.CL·May 13, 2026

ReAD proposes a Reinforcement-guided Capability Distillation framework for Large Language Models, aiming to compress LLMs while preserving essential abilities for downstream tasks. It explicitly accounts for the interdependence of capabilities, optimizing token budget usage and mitigating degradation of useful abilities.

Model Compression Knowledge Distillation LLMs reinforcement learning learning

Read original ↗