← heapsort-ai

machine learning training

1 items

RESEARCHarXiv CS.LG·4/23/2026

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

DR-Venus introduces a frontier 4B deep research agent for edge-scale deployment, trained effectively with only 10K open data. Its two-stage training recipe combines agentic supervised fine-tuning for basic capabilities and agentic reinforcement learning for improved execution reliability on long-horizon tasks, optimizing data quality and utilization.

28