Java Linux, Windows Sometimes I need to copy a text from a web page and it turns out to be an image, that can be copied as an image. The idea is to feed the copied image to an OCR and than paste the resulting text. We would probably rely on jocr to do th
Note as of 2013-09-13: I'm moving this project over to github due to this:
http://www.gluster.org/2013/08/how-far-the-once-mighty-sourceforge-has-fallen/
Feel free to rejoin the more updated versions on
https://github.com/mnott/PDFOCRWrapper
Thanks.
Matthias
--
This is a wrapper written in Java that allows to recursively iterate a directory structure and call an OCR engine on each found PDF on the condition that it hat not yet been called for that PDF. It works well with the ABBYY OCR Engine for Linux.
Java open source scanner for all platform. This application make the use of JSane. It also includes OCR for Thai and English characters. This project is supported and funded by Thai Life Insurance Company - A Thai Company for the Thai people (http://
Socr3 is a plugin-oriented, open source platform upon which I'm building an OCR suite. The name Socr3 stands for "Open Source Optical Character Recognition, Reading, Rendering, and Exporting", and is subject to change in the future.
Neuroph OCR - Handwriting Recognition is developed to recognize hand written letter and characters. It's engine derived's from the Java Neural Network Framework - Neuroph and as such it can be used as a standalone project or a Neuroph plug in.
SecureJDMS is the attempt to develop a secure DMS, based on an RCP and (even untrusted) DB for data storage. For now, its all about managing scanned documents and searching them by content (using OCR). All data sent and stored will be strongly encrypted.
An image postprocessor for the DIY Book Scanner described on instructables.com and diybookscanner.org. Gets images ready for OCR or for PDF. Written in Java based on a partial port of the Leptonica image processing library.
JOcrad is a graphical frontend for GNU/Ocrad written in Java.
GNU Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method.JOcrad supports italian and english languages, JPG,PNG and GIF images.
Joshi is a program that tries to recognize shapes on an image with a focus on OCR. It converts the image to vector graphics (polylines) and then tries to project these on stored vector graphics, calculating the best match.