← heapsort
RESEARCH40

CrowdMath: A Dataset of Crowdsourced Mathematical Research Discussions

arXiv CS.AIΒ·June 8, 2026

This paper introduces CrowdMath, a dataset of 164 expert-annotated progress chains from the MIT PRIMES--Art of Problem Solving CrowdMath program. It aims to evaluate large language models on collaborative open-problem solving in mathematical research, diverging from benchmarks focused on final answers or complete proofs.

Read original β†—