OCR error yielding a double byte unicode

Quickly OCR part of the screen and save resulting text to clipboard

Brought to you by: cb4960

#179 OCR error yielding a double byte unicode

Milestone: 4.X

Status: closed

Owner: cb4960

Labels: OCR;quality;recognition (3)

Updated: 2022-03-19

Created: 2021-08-15

Creator: Sammy MoK

Private: No

I am running into intermittent OCR errors where the ' character resulted in a double byte unicode of 0x2019. The text I am running the OCR on is "MEISSNER'S CORPUSCLE". Any suggestions on what is the best way to circumvent this? At least a way to prevent double byte unicode characters in the result?

I have attached the bmp file of the source text image.

1 Attachments

OCR Error Source.bmp

Discussion

Sammy MoK - 2021-08-15

BTW, I am using Capture2Text_CLI.exe version 4.6.2 and invoking it using AutoHotKey script and getting result back in clipboard.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

cb4960 - 2022-03-19

status: open --> closed

assigned_to: cb4960
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

cb4960 - 2022-03-19

In v4.6.3, replaced Unicode single quote (’) with an ASCII single quote ('). Also replaced (“) and (”) with (").

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

OCR error yielding a double byte unicode

Quickly OCR part of the screen and save resulting text to clipboard

Milestone

Searches

Help

#179 OCR error yielding a double byte unicode

Discussion