← heapsort
RESEARCH28

Can Generalist Agents Automate Data Curation?

arXiv CS.AIΒ·June 4, 2026

Generalist coding agents show potential in automating the labor-intensive process of data curation for AI development, as tested on the new Curation-Bench benchmark. While agents achieve strong baselines, an "execution-research gap" is observed where they primarily refine existing policies instead of exploring novel approaches.

Read original β†—