Capture2Text / Tickets / #37 columns

#37 columns

Milestone: 4.X

Status: pending

Owner: nobody

Labels: enhancement (18)

Updated: 2022-07-13

Created: 2018-02-02

Creator: jon

Private: No

I'm OCRing a table of contents, in three columns: an item number, item name, and page number. I'd like to grab a bunch of rows at a time. What happens is that the program is assuming I'm reading a newspaper, and it reassembles all the item numbers in one row, followed by all the item names, and finally all the page number, requiring meticulous cutting and pasting to assemble the way I want. One work around would be to grab rows - an item number, item name, and page number - one at a time. Is there anyway to tell the program that I'm not reading a newspaper, but rather a table of contents? I bumped into the same problem when I OCRed some assembly language code.

Discussion

cb4960 - 2018-04-20

labels: --> enhancement

status: open --> pending

Milestone: 4.5.0 --> 4.X
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Quantillion - 2022-07-13

Tesseract Command Line offers tesseract images/2col.png - --psm 3
Would this work with Capture2Text_CLI.exe? Maybe add it to Options/Settings/Output format?
Great tool still!

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

columns

Quickly OCR part of the screen and save resulting text to clipboard

Milestone

Searches

Help

#37 columns

Discussion