I have a trial version of FineReader 12 pro but still cannot work out the following issue ; I need to OCR texts containing pinyin diacritics ( o ā ɑ̄ ē ī ō ū ǖ / Ā Ē Ī Ō Ū Ǖ /á ɑ́ é í ó ú ǘ / Á É Í Ó Ú Ǘ / ǎ ɑ̌ ě ǐ ǒ ǔ ǚ / Ǎ Ě Ǐ Ǒ Ǔ Ǚ / à ɑ̀ è ì ò ù ǜ / À È Ì Ò Ù Ǜ / a ɑ e i o u ü / A E I O U o ā ɑ̄ ē ī ō ū ǖ / á ɑ́ é í ó ú ǘ /ǎ ɑ̌ ě ǐ ǒ ǔ ǚ / à ɑ̀ è ì ò ù ǜ / a ɑ e i o u ü) which the software either does not recognize or even mix up.
In previous versions of the software I tried using languages that actually have such diacritics (such as Czech), training, creating user languages and adding them to their dictionaries etc, finding no success at all.
I would really appreciate some advice on how to solve this problem, if possible, as soon as possible. would Tesseract finally work it out?
Hope to hear news soon. Thanx in advance!
Log in to post a comment.
Sign up for the SourceForge newsletter:
You seem to have CSS turned off.
Please don't fill out this field.