Parser Bench

PDF parser benchmarking suite comparing extraction quality across PyMuPDF, pdfplumber, pypdf, and LiteParse

Standard Suite

Content accuracy, speed, bounding boxes, and table extraction across 5 documents

Hard Suite

Reading order, column handling, watermark filtering, table row integrity, and numeric precision across 6 documents

View source on GitHub