← heapsort-ai

Catastrophic Forgetting

5 items

RESEARCHarXiv CS.LG·4/15/2026

A Layer-wise Analysis of Supervised Fine-Tuning

This research analyzes Supervised Fine-Tuning (SFT), revealing that instruction-following capabilities emerge distinctly across layers: middle layers are stable while final layers are highly sensitive. Leveraging this, the authors propose Mid-Block Efficient Tuning, which updates critical intermediate layers, outperforming standard LoRA with reduced parameter overhead.

27
RESEARCHarXiv CS.LG·11d ago

Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?

This paper investigates the mechanistic origins of catastrophic forgetting in Large Language Models (LLMs), comparing Reinforcement Learning (RL) with Supervised Fine-Tuning (SFT). It reveals that RL preserves internal computational circuits more effectively, mitigating the forgetting of prior capabilities, unlike SFT which causes greater circuit disruption.

27