afewthingz.com - A few thingz

Description: Personal blog of Joseph Basquin. Music, math, software development and a few other things.

music (51679) software (33394) development (16059) audio (11180) python (3829) math (2147) dsp (524) vst (251) machinelearning (43) bigpicture (6)

Example domain paragraphs

There are many solutions to work on PDF files with Python. Depending on whether you need to read, parse data, extract tables, modify (split, merge, crop...), or create a new PDF, you will need different tools.

Here is a quick diagram of some common tools I have used:

PyPDF2 is a free and open-source pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files. It can also add custom data, viewing options, and passwords to PDF files. PyPDF2 can retrieve text and metadata from PDFs as well.