PDF parser benchmarking suite comparing extraction quality across PyMuPDF, pdfplumber, pypdf, and LiteParse
Content accuracy, speed, bounding boxes, and table extraction across 5 documents
Reading order, column handling, watermark filtering, table row integrity, and numeric precision across 6 documents
View source on GitHub