← heapsort
RESEARCH27

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

arXiv CS.AIΒ·May 6, 2026

This paper introduces CreativityBench, a new benchmark to evaluate LLMs' creative reasoning abilities through affordance-based tool repurposing. It details the construction of a large-scale affordance knowledge base and the generation of 14K tasks requiring non-obvious yet physically plausible solutions.

Read original β†—