I have a trial version of FineReader 12 pro but still cannot work out the following issue ; I need to OCR texts containing pinyin diacritics ( o ā ɑ̄ ē ī ō ū ǖ / Ā Ē Ī Ō Ū Ǖ /á ɑ́ é í ó ú ǘ / Á É Í Ó Ú Ǘ / ǎ ɑ̌ ě ǐ ǒ ǔ ǚ / Ǎ Ě Ǐ Ǒ Ǔ Ǚ / à ɑ̀ è ì ò ù ǜ / À È Ì Ò Ù Ǜ / a ɑ e i o u ü / A E I O U o ā ɑ̄ ē ī ō ū ǖ / á ɑ́ é í ó ú ǘ /ǎ ɑ̌ ě ǐ ǒ ǔ ǚ / à ɑ̀ è ì ò ù ǜ / a ɑ e i o u ü) which the software either does not recognize or even mix up.
In previous versions of the software I tried using languages that actually have such diacritics (such as Czech), training, creating user languages and adding them to their dictionaries etc, finding no success at all.
I would really appreciate some advice on how to solve this problem, if possible, as soon as possible. would Tesseract finally work it out?
Hope to hear news soon. Thanx in advance!
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi !
I have a trial version of FineReader 12 pro but still cannot work out the following issue ; I need to OCR texts containing pinyin diacritics ( o ā ɑ̄ ē ī ō ū ǖ / Ā Ē Ī Ō Ū Ǖ /á ɑ́ é í ó ú ǘ / Á É Í Ó Ú Ǘ / ǎ ɑ̌ ě ǐ ǒ ǔ ǚ / Ǎ Ě Ǐ Ǒ Ǔ Ǚ / à ɑ̀ è ì ò ù ǜ / À È Ì Ò Ù Ǜ / a ɑ e i o u ü / A E I O U o ā ɑ̄ ē ī ō ū ǖ / á ɑ́ é í ó ú ǘ /ǎ ɑ̌ ě ǐ ǒ ǔ ǚ / à ɑ̀ è ì ò ù ǜ / a ɑ e i o u ü) which the software either does not recognize or even mix up.
In previous versions of the software I tried using languages that actually have such diacritics (such as Czech), training, creating user languages and adding them to their dictionaries etc, finding no success at all.
I would really appreciate some advice on how to solve this problem, if possible, as soon as possible. would Tesseract finally work it out?
Hope to hear news soon. Thanx in advance!