← heapsort
RESEARCH27

Breaking the Illusion: When Positive Meets Negative in Multimodal Decoding

arXiv CS.LGΒ·May 11, 2026

A new training-free inference framework, Positive-and-Negative Decoding (PND), is introduced to address object hallucination in Vision-Language Models (VLMs). PND enforces visual fidelity by using a dual-path contrast mechanism, leading to state-of-the-art performance without retraining.

Read original β†—