← heapsort
RESEARCH27

JobBench: Aligning Agent Work With Human Will

arXiv CS.AIΒ·May 27, 2026

JobBench is a new benchmark that evaluates AI agents on workflows identified by experts as high-priority for delegation, covering 130 tasks across 35 occupations. It aims to shift the labour-market effect from replacement to enhancement, building agents that do what humans actually want delegated.

Read original β†—