RESEARCH28
Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning
arXiv CS.CLΒ·May 21, 2026
Large language models struggle with complex long-context reasoning tasks despite supporting extensive inputs. ProxyCoT is a novel training framework designed to transfer reasoning capabilities from short proxy contexts to full long contexts, outperforming strong baselines.
Read original β