RESEARCH27
A Layer-wise Analysis of Supervised Fine-Tuning
arXiv CS.LGΒ·April 15, 2026
This research analyzes Supervised Fine-Tuning (SFT), revealing that instruction-following capabilities emerge distinctly across layers: middle layers are stable while final layers are highly sensitive. Leveraging this, the authors propose Mid-Block Efficient Tuning, which updates critical intermediate layers, outperforming standard LoRA with reduced parameter overhead.
Supervised Fine-TuningLayer-wise AnalysisCatastrophic Forgettinglarge language modelsEfficient Tuning
Read original β