RESEARCH27

Revealing Interpretable Failure Modes of VLMs

arXiv CS.AI·May 14, 2026

Vision-Language Models (VLMs) can exhibit catastrophic failures in real-world situations despite their broad reasoning capabilities. REVELIO is introduced as a framework to systematically uncover interpretable failure modes in VLMs by combining diversity-aware beam search and Gaussian-process Thompson Sampling to map the failure landscape.

failure modes AI models VLMs Reliability interpretable AI

Read original ↗