py-pdfminer

v 20240706 Updated: 4 months ago

Python pdf extraction package

Pdfminer.six is a community maintained fork of the original PDFMiner. It is a tool for extracting information from PDF documents. It focuses on getting and analyzing text data. Pdfminer.six extracts the text from a page directly from the sourcecode of the PDF. It can also be used to get the exact location, font or color of the text. It is built in a modular way such that each component of pdfminer.six can be replaced easily. You can implement…

https://pdfminersix.readthedocs.io/

Installable ports:


Add to my watchlist

Installations 3
Requested Installations 2