Provides optical character recognition (OCR) solutions for Vietnamese language.
Features
- Java & .NET GUI frontends for Tesseract OCR engine
- Supports all languages provided by Tesseract
- Supports automatic download and installation of language packs
- PDF, TIFF, JPEG, GIF, PNG, BMP image formats
- Paste image from clipboard
- Selection box for Region of Interest (ROI)
- File drag-and-drop
- Bulk & batch operations
- Text replacement postprocessing
- Integrated scanning support
- Spellcheck with Hunspell
License
Apache License V2.0Follow VietOCR
You Might Also Like
Our Free Plans just got better! | Auth0 by Okta
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
Rate This Project
Login To Rate This Project
User Reviews
-
Very poor UX You start the sofware... and THEN WHAT? How do you train Tesseract? That was the whole point of installing this software. Very very poor UX. No idea what this software even does.
-
Quan Nguyeng, I've been following your VietOCR project for 2 years. You constantly update your program. Using the 'bulk' option in version 6.3.1 (I'll update to 6.4.0) under Windows 11, VietOCR took less than 13 minutes to perfectly extract text from 584 .jpg files (396MB) to 584 .txt files (453Kb). Your VietOCR program is totally amazing! I've recommended VietOCR to relatives working in Laos and Myanmar. If there is any way that I could make a donation (PayPal address maybe?), please let me know. I'm very grateful for your hard work. namitutonka - Denver, Colorado
-
Dears, Great GUI :) Latest version either beta or not can't fetch the tessdata *.traineddata files because tesseract changed their address. I had to do it manually :) please fix :)
-
Worked almost out of the box for greek polytonic OCR. (Not an easy case, trust me). I feel excited. Thank you "nguyeng"
-
Bardzo dobry program do rozpoznawania tekstu z zeskanowanych dokumentów. Łatwy w użyciu. Mankamentem dla niektórych może być fakt, że wyjściowym formatem jest plik tekstowy, a nie "Word". Niemniej program spisuje się bardzo dobrze przy rozpoznawaniu znaków (także tekstów w języku polskim).