I'm using Tesseract in Ubuntu 6.10, and I like it's ability to recognize text.
I can't find anything on the Web showing how to scan more than one image at a time, though. I just scanned 15 pages of an old short story with xsane, but it looks like I have to do the OCR one page at a time.
Does anyone know how to get Tesseract to recognize text of a batch of images?
Tess only does one page at a time so you have to build your program around this.
Tess is now hosted at:
I just found a way to do this with ocube. I posted a how-to at <a href="http://ubuntuforums.org/showthread.php?t=404619&highlight=tesseract"> http://ubuntuforums.org/showthread.php?t=404619&highlight=tesseract</a>
I believe I saw ocube mentioned in this forum, but it took me quite a while to figure out how to install and use it.
Log in to post a comment.