Overview

Performs OCR-based text extraction using Google Cloud Vision API. Supported formats: GIF, JPEG, PDF, PNG, TIFF.

Output generated as JSON file, loadable via json.load.

Requirements

pip3 install requirements.pip

env GOOGLE_APPLICATION_CREDENTIALS= ./textract.py --input <> --output <>.json

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
textract.py		textract.py