RESEARCH27
Rotation-Preserving Supervised Fine-Tuning
arXiv CS.LGΒ·May 13, 2026
This paper introduces Rotation-Preserving Supervised Fine-Tuning (RPSFT) to improve out-of-domain generalization in large language models while mitigating the degradation caused by standard SFT. RPSFT penalizes changes in projected singular subspaces of pretrained weights, acting as an efficient proxy for Fisher-sensitive directions and outperforming standard SFT baselines.
Read original β