← heapsort
RESEARCH27

Evaluating Reasoning Models for Queries with Presuppositions

arXiv CS.CLΒ·May 6, 2026

This research evaluates how large reasoning models handle user queries containing factually inaccurate presuppositions. It finds that while reasoning models show a slight improvement over non-reasoning models, they still fail to challenge a significant fraction of false assumptions.

Read original β†—