Re: [gscan2pdf-help] gscan2pdf v1.2.6 released
Brought to you by:
ra28145
|
From: Cédric M. <cma...@gm...> - 2014-10-07 09:07:24
|
Thank you very much for the new version! > * + 'save hOCR', to save hOCR output, where available. Great!! It's exactly what I needed! But Unfortunately, it doesn't work as expected. If somebody knows how I can combine a hocr file with an image to produce a pdf, just let me know. I tried hocr2pdf and I suspect the result can be bad. I think it's a problem of hocr2pdf, that's the reason why I didn't post here. If you are interested by the discussion, you can follow it here: https://groups.google.com/forum/#!topic/tesseract-ocr/phSR1rCBtzg To explain in a few words: I can't use gscan2pdf because I have an image with a colored background and the result of the OCR is poor (I use tesseract). But with gscan2pdf I can convert the picture in black and white, do the OCR and save the result in hocr format. When the image is in black and white, the OCR with tesseract is good. The problem is after that to merge the hocr with the image with a colored background. If somebody has some hints, just let me know. Regards, Cédric |