RESEARCH29
DecisionBench: A Benchmark for Emergent Delegation in Long-Horizon Agentic Workflows
arXiv CS.AIΒ·May 20, 2026
DecisionBench is introduced as a new benchmark for emergent delegation in long-horizon agentic workflows. It includes a fixed task suite, a peer-model pool, and a multi-axis metric suite to evaluate delegation quality and cost.
Read original β