RESEARCH27
Empirical Evaluation of PDF Parsing and Chunking for Financial Question Answering with RAG
arXiv CS.CLΒ·April 15, 2026
This paper addresses the challenges of automated PDF processing for AI, particularly with RAG systems, by proposing an empirical study. It evaluates various PDF parsers and chunking strategies for Question Answering in the financial domain, introducing a new benchmark called TableQuest.
Read original β