RESEARCH27
Evaluating Reasoning Models for Queries with Presuppositions
arXiv CS.CLΒ·May 6, 2026
This research evaluates how large reasoning models handle user queries containing factually inaccurate presuppositions. It finds that while reasoning models show a slight improvement over non-reasoning models, they still fail to challenge a significant fraction of false assumptions.
Read original β