RESEARCH27
Harnesses for Inference-Time Alignment over Execution Trajectories
arXiv CS.LGΒ·May 23, 2026
This research investigates harness engineering as an inference-time technique for large language model (LLM) agents, focusing on improving long-term performance via task decomposition and guided execution. It quantifies how design elements like workflow granularity and guidance impact performance, revealing common failure modes such as over-decomposition and hallucinated execution.
Read original β