RESEARCH28
MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models
arXiv CS.CLΒ·June 5, 2026
MCBench is a new benchmark designed to assess the safety of Omni Large Language Models across vision, audio, and text inputs, revealing significant challenges in integrating multiple modalities for accurate safety judgments. It highlights that current Omni LLMs lack robust cross-modal reasoning in safety-critical settings.
Read original β