gImageReader is a simple Gtk/Qt front-end to tesseract. Features include:
- Import PDF documents and images from disk, scanning devices, clipboard and screenshots
- Process multiple images and documents in one go
- Manual or automatic recognition area definition
- Recognize to plain text or to hOCR documents
- Recognized text displayed directly next to the image
- Post-process the recognized text, including spellchecking
- Generate PDF documents from hOCR documents

**Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.

Features

  • Import PDF documents and images from disk, scanning devices, clipboard and screenshots
  • Process multiple images and documents in one go
  • Manual or automatic recognition area definition
  • Recognize to plain text or to hOCR documents
  • Recognized text displayed directly next to the image
  • Post-process the recognized text, including spellchecking
  • Generate PDF documents from hOCR documents

Project Samples

Project Activity

See All Activity >

Follow gImageReader

gImageReader Web Site

Other Useful Business Software

Pyramid - The Complete Self-Service Analytics Platform Pyramid - The Complete Self-Service Analytics Platform Icon
Pyramid - The Complete Self-Service Analytics Platform Icon

Easily combine data from multiple sources to build dashboards and reports with our scalable solution built for mid-size to enterprise companies.

Pyramid combines analytic tools often found in separate applications. It features a graphically based ETL to model data and incorporate machine learning algorithms, intuitive data discovery tools that produce simple and advanced visualizations, a presentation builder for creating analytic applications and presentations, and an enterprise-grade report builder—all in a single, integrated platform that can be accessed from any browser or device.

Rate This Project

Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
22
3
1
0
1
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5

User Reviews

  • I find gImage Reader a practical and stable frontend to Tesseract OCR.Spell checking and editing text works fine for normal purposes where Tesseract OCR nowadays is very accurate.Unusual fonts as Fraktur (in heads of many longstanding newspapers) is supported and the Tesseract engine will (with effort) train any uncommon typography.The gImage frontend is stable and uncomplicated to work with when set up, and the only improvement I can think of is a way to easily close the sources pane for maximizing the head pane when reviewing and editing recognised text.A very useful program, thank you. Edit: glad to find a toggle option for the suggested improvement already exist (I'm slightly visionary impaired).The Tesseract Fraktur "language" I work with is fine but not programmed to trigger the spell checker and the output therefore interpreted with spelling errors / understrikes.This small problem (unrelated to gImageReader) could be solved by a spell checker on/off switch.Thank you again.

  • This is a great tool to do a proof of concept using tesseract.

  • Great tool, saved me from professional disaster, thank you!

  • tested on MS Windows versión, very nice, free and funtional

  • En la version anterior habían problemas, pero el soporte de Sandro fue genial!!! Me ayudó en cada momento, de manera rápida y precisa. Lo instalé en OpenSuse 13.1 y funciona de 10!!! Lo recomiendo, en linux, el mejor programa de este tipo (OCR) que me ha funcionado!

Read more reviews >

Additional Project Details

Languages

English

Intended Audience

End Users/Desktop

User Interface

Gnome, KDE, Qt, GTK+

Programming Language

C++

Registered

2009-11-29