DecisionBench: A Benchmark for Emergent Delegation in Long-Horizon Agentic Workflows
DecisionBench is introduced as a new benchmark for emergent delegation in long-horizon agentic workflows. It includes a fixed task suite, a peer-model pool, and a multi-axis metric suite to evaluate delegation quality and cost.