RESEARCH27
CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing
arXiv CS.AIΒ·May 6, 2026
This paper introduces CreativityBench, a new benchmark to evaluate LLMs' creative reasoning abilities through affordance-based tool repurposing. It details the construction of a large-scale affordance knowledge base and the generation of 14K tasks requiring non-obvious yet physically plausible solutions.
Read original β