← heapsort
RESEARCH27

Anchor: Mitigating Artifact Drift in Agent Benchmark Generation

arXiv CS.AIΒ·May 27, 2026

Anchor is a task-generation pipeline that addresses "artifact drift" in AI agent benchmark creation. It formalizes domain experts' specifications into constraint optimization programs, jointly producing consistent instructions, environments, solutions, and verifiers for business operations.

Read original β†—