RESEARCH27
JobBench: Aligning Agent Work With Human Will
arXiv CS.AIΒ·May 27, 2026
JobBench is a new benchmark that evaluates AI agents on workflows identified by experts as high-priority for delegation, covering 130 tasks across 35 occupations. It aims to shift the labour-market effect from replacement to enhancement, building agents that do what humans actually want delegated.
Read original β