← heapsort
RESEARCH27

Empirical Evaluation of PDF Parsing and Chunking for Financial Question Answering with RAG

arXiv CS.CLΒ·April 15, 2026

This paper addresses the challenges of automated PDF processing for AI, particularly with RAG systems, by proposing an empirical study. It evaluates various PDF parsers and chunking strategies for Question Answering in the financial domain, introducing a new benchmark called TableQuest.

Read original β†—