RESEARCH27
Adaptive Test-Time Compute Allocation with Evolving In-Context Demonstrations
arXiv CS.AIΒ·April 25, 2026
This work introduces an innovative framework for adaptive test-time compute allocation, jointly adjusting where computation is spent and how generation is performed. The method uses a warm-up phase to identify easy queries and then concentrates further computation on unresolved queries, reshaping generation distributions with evolving in-context demonstrations.
Read original β