Skip to content

yairl/textract

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Overview

Performs OCR-based text extraction using Google Cloud Vision API. Supported formats: GIF, JPEG, PDF, PNG, TIFF.

Output generated as JSON file, loadable via json.load.

Requirements

pip3 install requirements.pip

How to run

env GOOGLE_APPLICATION_CREDENTIALS= ./textract.py --input <> --output <>.json

About

Text extraction from various input formats

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages