← heapsort-ai

clinical reports

1 items

RESEARCHarXiv CS.CL·5/6/2026

MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports

MedStruct-S is a new benchmark for semi-structured information extraction from OCR-derived clinical reports, addressing challenges like heterogeneous key representations and OCR noise. It aims to evaluate model robustness in real-world settings for key discovery, key-conditioned QA, and key-value pair extraction.

27