RESEARCH29
Exploring Autonomous Agentic Data Engineering for Model Specialization
arXiv CS.CLΒ·June 1, 2026
This paper introduces 'Autonomous Agentic Data Engineering,' a novel task to evaluate LLMs as autonomous data engineers for model specialization through end-to-end data curation. Experiments show autonomous LLM data engineers achieve substantial gains, with GPT-5.2 improving a student model by 57.29%.
Read original β