← heapsort-ai

CoT

1 items

RESEARCHarXiv CS.LG·15d ago

The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models

This research study reveals that small instruction-tuned language models (LMs) using Chain-of-Thought (CoT) for arithmetic often employ a positional shortcut, copying whichever number occupies the trailing position before the answer delimiter. This shortcut dominates, even when intermediate reasoning is correct, significantly impacting answer accuracy.

29