Tag: tesseract

23 Jul

PyPDFOCR - Guide to Usage

  • Take a scanned PDF file and run OCR on it (using the Tesseract OCR software from Google), generating a searchable PDF
  • Optionally, watch a folder for incoming scanned PDFs and automatically run OCR on them
  • Optionally, file the scanned PDFs into directories based on simple keyword matching that you specify …

22 Jul

PyPDFOCR - A Python Script for Free OCR on Your PDFs using Tesseract

skip_better dictionary

Updates:

  • Anytime - Get the latest updates on PyPDFOCR
  • 10/28/13 - Adds uploading to Evernote notebooks based on keywords!
  • 10/25/13 - Supports filing to directories based on keyword search
  • 10/22/13 - Now on PyPI, so you can just do "pip install pypdfocr"! (For windows, I still recommend downloading …

© Virantha Ekanayake. Built using Pelican. Modified svbhack theme, based on theme by Carey Metcalfe