← heapsort
RESEARCH27

Revealing Interpretable Failure Modes of VLMs

arXiv CS.AIΒ·May 14, 2026

Vision-Language Models (VLMs) can exhibit catastrophic failures in real-world situations despite their broad reasoning capabilities. REVELIO is introduced as a framework to systematically uncover interpretable failure modes in VLMs by combining diversity-aware beam search and Gaussian-process Thompson Sampling to map the failure landscape.

Read original β†—