RESEARCH40
CrowdMath: A Dataset of Crowdsourced Mathematical Research Discussions
arXiv CS.AIΒ·June 8, 2026
This paper introduces CrowdMath, a dataset of 164 expert-annotated progress chains from the MIT PRIMES--Art of Problem Solving CrowdMath program. It aims to evaluate large language models on collaborative open-problem solving in mathematical research, diverging from benchmarks focused on final answers or complete proofs.
Read original β