RESEARCH27

Linear Probes Detect Task Format, Not Reasoning Mode in Language Model Hidden States

arXiv CS.CL·June 3, 2026

This paper reveals that linear probes, often used to identify distinct reasoning representations in LLM hidden states, actually detect task format rather than reasoning modes. High accuracy observed on benchmarks with Qwen3-14B vanished when controlling for format variables, suggesting largely shared reasoning not functionally linked to hidden state geometry.

Benchmarking Natural Language Processing Model Analysis AI Research LLM

Read original ↗