gImageReader is a simple Gtk/Qt front-end to tesseract. Features include:
- Import PDF documents and images from disk, scanning devices, clipboard and screenshots
- Process multiple images and documents in one go
- Manual or automatic recognition area definition
- Recognize to plain text or to hOCR documents
- Recognized text displayed directly next to the image
- Post-process the recognized text, including spellchecking
- Generate PDF documents from hOCR documents

**Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.

Features

  • Import PDF documents and images from disk, scanning devices, clipboard and screenshots
  • Process multiple images and documents in one go
  • Manual or automatic recognition area definition
  • Recognize to plain text or to hOCR documents
  • Recognized text displayed directly next to the image
  • Post-process the recognized text, including spellchecking
  • Generate PDF documents from hOCR documents

Project Samples

Project Activity

See All Activity >

Follow gImageReader

gImageReader Web Site

Other Useful Business Software

Trial SolarWinds Backup and you could receive a free Amazon Echo Dot Trial SolarWinds Backup and you could receive a free Amazon Echo Dot Icon
SolarWinds Backup provides lightning-quick recovery that can restore business continuity after a disaster. For a limited time only, when you try SolarWinds Backup and store at least 100GB of selected data, you have the chance to receive a free Amazon Echo Dot.

Rate This Project

Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
22
3
1
0
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5

User Reviews

  • Stable and a nice touch is the OCR editing facility to enable manual correction of automated OCR errors. The program would be further enhanced by enabling output of the input PDF image file also as a PDF image file but with the OCR as searchable text layer under the page image; this instead of / as an additional option to the existng PDF output of OCR-only text without image.

  • I find gImage Reader a practical and stable frontend to Tesseract OCR.Spell checking and editing text works fine for normal purposes where Tesseract OCR nowadays is very accurate.Unusual fonts as Fraktur (in heads of many longstanding newspapers) is supported and the Tesseract engine will (with effort) train any uncommon typography.The gImage frontend is stable and uncomplicated to work with when set up, and the only improvement I can think of is a way to easily close the sources pane for maximizing the head pane when reviewing and editing recognised text.A very useful program, thank you. Edit: glad to find a toggle option for the suggested improvement already exist (I'm slightly visionary impaired).The Tesseract Fraktur "language" I work with is fine but not programmed to trigger the spell checker and the output therefore interpreted with spelling errors / understrikes.This small problem (unrelated to gImageReader) could be solved by a spell checker on/off switch.Thank you again.

  • This is a great tool to do a proof of concept using tesseract.

  • I keep interesting newspaper articles by taking a picture of them with pocket camera and OCR the image. with my 6 year old OCR program which Is no longer available but tp name it here would violate this site's antispam policy. I compared it to gImageReader on March 17, 2016. For my purposes, gImageReader is inferior. It could not handle the fact that the article was in 4 columns. This can easily be remedied by selecting the columns to be recognized one at a time. A photo is however not as clear as a scan. I could easily read this photo and my unnameable program OCR-ed with few errors but gImageReader made quite a bit more.

  • Great tool, saved me from professional disaster, thank you!

Read more reviews >

Additional Project Details

Languages

English

Intended Audience

End Users/Desktop

User Interface

Gnome, KDE, Qt, GTK+

Programming Language

C++

Registered

2009-11-29