Robuta

https://pymupdf.io/ PyMuPDF: The Python library for Fast Document Processing with Semantic Data Analysis PyMuPDF provides fast and powerful tools for reading, manipulating, and extracting semantic data from PDF documents, including text, images, metadata, and... python library https://pymupdf.readthedocs.io/en/latest/archive-class.html Archive - PyMuPDF documentation archivepymupdfdocumentation https://pymupdf.readthedocs.io/en/latest/coop_low.html Working together: DisplayList and TextPage - PyMuPDF documentation working togetherpymupdfdocumentation