PDF Text Extraction With Python

Notes

Is your data locked up in portable document format (PDFs)? In this talk we’re going to explore methods to extract text and other data from PDFs using readily-available, open-source Python tools (such as pypdf), as well as techniques such as OCR (optical character recognition) and table extraction. We will also discuss the philosophy of text extraction as a whole.

Newer

#Python
#Django
#SaaS
#Go
#std
#json
#template
#static

More Go Standard Library - Building SaaS #198

In this episode, we continued the break from JourneyInbox to look through more of the Go standard library. In this session, we explored JSON serialization, Go template support, and embedding …

Older

#SaaS
#Go
#standard
#std
#standard library

Go Standard Library App - Building SaaS #197

In this episode, we are taking a break from JourneyInbox and exploring what kind of Go app we can make by just using the Go standard library.

If you like my work and you want to say thanks, you can become a patron on Patreon. Thanks for supporting me!

Check out my Patreon Superhero supporters!

Book Recommendations

My library is full of technical books. Many of these were hugely influencial in helping me be a better software engineer.

Check out my favorite software books on the Book Recommendations page.