RESEARCH27
MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports
arXiv CS.CLΒ·May 6, 2026
MedStruct-S is a new benchmark for semi-structured information extraction from OCR-derived clinical reports, addressing challenges like heterogeneous key representations and OCR noise. It aims to evaluate model robustness in real-world settings for key discovery, key-conditioned QA, and key-value pair extraction.
Read original β