GNOME Screenshot OCR

A simple OCR (Optical Character Recognition) tool for the GNOME desktop environment that allows you to extract text as well as scan QR codes directly automatically from screenshots.

Features

Uses native GNOME screenshot portal
Minimal Dependencies (pytesseract and pyzbar only)
Single file for easy shortcut setup
Ability to scan QR codes without any additional setup
Can save as file directly
Can copy to clipboard directly
Supports multiple languages
Customizable save location
Customizable keyboard shortcuts

Requirements

Python 3.x (Preinstalled on most Linux distributions)
GTK 4 (Preinstalled on GNOME-based distributions)
Python Tesseract OCR (See below for installation instructions)
pyzbar (optional, for QR code scanning)

Installation

1. Install system dependencies:

# Ubuntu/Debian
sudo apt install tesseract-ocr

# Fedora
sudo dnf install tesseract
sudo dnf install python3-pytesseract

# Arch Linux
sudo pacman -S tesseract
sudo pacman -S python-pytesseract

# Using builtin python package manager
pip install pytesseract

You can also install `pyzbar` optionally for QR code scanning support:

# Ubuntu/Debian
sudo apt-get install libzbar0
sudo apt install python3-pyzbar 

# Arch Linux from AUR
yay -S pyzbar

#Using builtin python package manager
pip install pyzbar

2. For additional language support, install the corresponding Tesseract language packages:

# Example for hindi language support
sudo apt install tesseract-ocr-hin  # Ubuntu/Debian
sudo dnf install tesseract-langpack-hin  # Fedora
sudo pacman -S tesseract-data-hin  # Arch Linux

Usage

Basic usage:

python gnome-ocr-screenshot.py

Recommended Usage

Move the script to a directory in your PATH and create keyboard shortcut for quick access.

git clone https://github.com/funinkina/Gnome-OCR-Screenshot
cd Gnome-OCR-Screenshot
sudo cp gnome-screenshot-ocr.py /usr/local/bin/gnome-screenshot-ocr
# alternatively, you can create a symbolic link
ln -s gnome-screenshot-ocr.py /usr/local/bin/gnome-screenshot-ocr
sudo chmod +x ~/.local/bin/gnome-screenshot-ocr

Then make keyboard shortcut in gnome control center to run the script.

Open GNOME settings
Go to Keyboard Shortcuts
Add a new shortcut with the command gnome-screenshot-ocr with the appropriate arguments (see below)
Assign a key combination to the shortcut, for example: Meta+PrintScreen.

Command-line Options

--help: Show help message and exit.
--enablesaving: Keep the screenshot file after text extraction.
--nocloseonaction: Keep the application running after saving text or copying to clipboard.
--lang: Specify OCR language(s) (e.g., --lang eng+deu for English and German). Default is all the available languages of Tesseract data installed on your system.
--save-location: Set default directory for saving text files (e.g., --save-location ~/Documents). Default is the user's documents directory.

Example with options:

gnome-screenshot-ocr --lang eng+deu --save-location ~/Documents

How It Works

Launch the application
Select an area of your screen to capture
The application will extract text from the selected area
View the extracted text in a dialog window
Choose to either copy the text to clipboard or save it to a file

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
LICENSE		LICENSE
README.md		README.md
gnome-ocr-screenshot.py		gnome-ocr-screenshot.py
screenshot.png		screenshot.png
shortcut_demo.png		shortcut_demo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GNOME Screenshot OCR

Features

Requirements

Installation

1. Install system dependencies:

You can also install `pyzbar` optionally for QR code scanning support:

2. For additional language support, install the corresponding Tesseract language packages:

Usage

Recommended Usage

Then make keyboard shortcut in gnome control center to run the script.

Command-line Options

How It Works

License

About

Languages

License

funinkina/Gnome-OCR-Screenshot

Folders and files

Latest commit

History

Repository files navigation

GNOME Screenshot OCR

Features

Requirements

Installation

1. Install system dependencies:

You can also install pyzbar optionally for QR code scanning support:

2. For additional language support, install the corresponding Tesseract language packages:

Usage

Recommended Usage

Then make keyboard shortcut in gnome control center to run the script.

Command-line Options

How It Works

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages

You can also install `pyzbar` optionally for QR code scanning support: