Menu

#37 columns

4.X
pending
nobody
2022-07-13
2018-02-02
jon
No

I'm OCRing a table of contents, in three columns: an item number, item name, and page number. I'd like to grab a bunch of rows at a time. What happens is that the program is assuming I'm reading a newspaper, and it reassembles all the item numbers in one row, followed by all the item names, and finally all the page number, requiring meticulous cutting and pasting to assemble the way I want. One work around would be to grab rows - an item number, item name, and page number - one at a time. Is there anyway to tell the program that I'm not reading a newspaper, but rather a table of contents? I bumped into the same problem when I OCRed some assembly language code.

Discussion

  • cb4960

    cb4960 - 2018-04-20
    • labels: --> enhancement
    • status: open --> pending
    • Milestone: 4.5.0 --> 4.X
     
  • Quantillion

    Quantillion - 2022-07-13

    Tesseract Command Line offers tesseract images/2col.png - --psm 3
    Would this work with Capture2Text_CLI.exe? Maybe add it to Options/Settings/Output format?
    Great tool still!

     

Log in to post a comment.