Just recently scanned a pretty lenghy book with NAPS2 latest version (300dpi; Color > Save to PDF; default). The PDF was generated without errors producing ~60 Mb PDF-file with ~75 pages. The problem is that quite a few (say ~20) of those output pdf pages seemed to save "as image" (= non-searchable; which is the problem) within the pdf-file. The physical page size is pretty small and so is the printed font (that might be part of the problem).
In OCR settings the checkbox for "Make text searchable..." is on and selected language is Finnish.
The "image" pages appear in the produced pdf "randomly" here and there, I can not figure out a pattern what's causing that to happen. And yes, perhaps if I'd had used 600dpi resolution it could help, but that produces way too big files and scanning is slow.
So my questions are:
1) What PDF compatibility mode (when saving to pdf) should I use?
2) And does it (1) have any impact on the problem described?
3) What could cause this (= several pages are not text searchable) to happen (scanned pages are pretty similar) and any ways to solve this?
I can also link a sample of those PDF pages (an OK page and non-OK pdf page) if that could help sorting things out :)
Appreciated.
//Timo
Last edit: Timo 2018-09-11
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi all,
Just recently scanned a pretty lenghy book with NAPS2 latest version (300dpi; Color > Save to PDF; default). The PDF was generated without errors producing ~60 Mb PDF-file with ~75 pages. The problem is that quite a few (say ~20) of those output pdf pages seemed to save "as image" (= non-searchable; which is the problem) within the pdf-file. The physical page size is pretty small and so is the printed font (that might be part of the problem).
In OCR settings the checkbox for "Make text searchable..." is on and selected language is Finnish.
The "image" pages appear in the produced pdf "randomly" here and there, I can not figure out a pattern what's causing that to happen. And yes, perhaps if I'd had used 600dpi resolution it could help, but that produces way too big files and scanning is slow.
So my questions are:
1) What PDF compatibility mode (when saving to pdf) should I use?
2) And does it (1) have any impact on the problem described?
3) What could cause this (= several pages are not text searchable) to happen (scanned pages are pretty similar) and any ways to solve this?
I can also link a sample of those PDF pages (an OK page and non-OK pdf page) if that could help sorting things out :)
Appreciated.
//Timo
Last edit: Timo 2018-09-11
Seems I have been able to isolate the problem:
Ben, could you look into this > Move this ticket into the Bugs side perhaps...?
//timo
Last edit: Timo 2018-09-11
In your installation folder, edit appsettings.xml and change OcrTimeout to 600.
Hugely appreciated, that did THE trick!
Could even be FAQ topic for ppl saving lenghty multi-page scans?
Thx Ben, the best gets even better (the upcoming release) :)
//timo