Provides optical character recognition (OCR) solutions for Vietnamese language.

Features

  • Java & .NET GUI frontends for Tesseract OCR engine
  • Supports all languages provided by Tesseract
  • Supports automatic download and installation of language packs
  • PDF, TIFF, JPEG, GIF, PNG, BMP image formats
  • Paste image from clipboard
  • Selection box for Region of Interest (ROI)
  • File drag-and-drop
  • Bulk & batch operations
  • Text replacement postprocessing
  • Integrated scanning support
  • Spellcheck with Hunspell

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow VietOCR

VietOCR Web Site

Other Useful Business Software
Expand, Onboard, Manage & Pay global teams on a single platform. Icon
Expand, Onboard, Manage & Pay global teams on a single platform.

Thousands of companies partner with Atlas to manage Global Remote Teams.

For companies looking for a solution to expand across borders, onboard talent, manage compliance, and pay their global workforce. Our cloud-based HR platform connects core HR, payments management, flexible talent management, benefits administration, and people & country analytics to help you deliver exceptional employee experiences.
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
15
4
0
1
2
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5

User Reviews

  • Quan Nguyeng, I've been following your VietOCR project for 2 years. You constantly update your program. Using the 'bulk' option in version 6.3.1 (I'll update to 6.4.0) under Windows 11, VietOCR took less than 13 minutes to perfectly extract text from 584 .jpg files (396MB) to 584 .txt files (453Kb). Your VietOCR program is totally amazing! I've recommended VietOCR to relatives working in Laos and Myanmar. If there is any way that I could make a donation (PayPal address maybe?), please let me know. I'm very grateful for your hard work. namitutonka - Denver, Colorado
  • Dears, Great GUI :) Latest version either beta or not can't fetch the tessdata *.traineddata files because tesseract changed their address. I had to do it manually :) please fix :)
  • Worked almost out of the box for greek polytonic OCR. (Not an easy case, trust me). I feel excited. Thank you "nguyeng"
  • Bardzo dobry program do rozpoznawania tekstu z zeskanowanych dokumentów. Łatwy w użyciu. Mankamentem dla niektórych może być fakt, że wyjściowym formatem jest plik tekstowy, a nie "Word". Niemniej program spisuje się bardzo dobrze przy rozpoznawaniu znaków (także tekstów w języku polskim).
  • Отличная программа для распознавания документов. На мой личный взгляд лучшая среди опенсоурсных. Наиболее качественно написана, то есть меньше всего ошибок возникает при работе.
Read more reviews >

Additional Project Details

Languages

Dutch, Persian, Polish, Lithuanian, Czech, Italian, Catalan, Vietnamese, English, Slovak, Turkish, Hindi, Japanese, Russian

Intended Audience

Developers, End Users/Desktop

User Interface

Java Swing, .NET/Mono

Programming Language

C#, Java

Registered

2008-06-09