Only in some pdf cases, that the all discussions I had with pdf creator and pdfgrep developpers.
How I can send you a sample pdf where docFetcher won't perform the content text search?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
DocFetcher's PDF content extraction relies on a third-party component calld Apache PDFBox, so if it doesn't work on certain files for some reason, there isn't much I could do about it anyway.
In any case, my (reversed) address is: users.sourceforge.net <- qforce@
Last edit: Nam-Quang Tran 2015-09-22
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Humm, sorry it seems I faulty constructed the index, indeed docFetcher finds within searcheable pdf (contractry to grepWin), but still you should look at the new --warn-empty from https://pdfgrep.org/news.html
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thanks for docFetcher.
Feature request:
would you be so kind to integrate true pdf content search, the best tool I know being https://pdfgrep.org/
I have pdf examples where docFetcher 1.1.16 (and other tools such as grepWin) will not find text content, while pdfgrep does.
P.S. There is a non upto date pdfgrep version for Windows on http://soft.rubypdf.com/software/pdfgrep-windows-version
Last edit: phil 2015-09-16
What exactly is "true pdf content search"? If you're talking about searching for words in the contents of PDF files, DocFetcher already does that.
Only in some pdf cases, that the all discussions I had with pdf creator and pdfgrep developpers.
How I can send you a sample pdf where docFetcher won't perform the content text search?
DocFetcher's PDF content extraction relies on a third-party component calld Apache PDFBox, so if it doesn't work on certain files for some reason, there isn't much I could do about it anyway.
In any case, my (reversed) address is: users.sourceforge.net <- qforce@
Last edit: Nam-Quang Tran 2015-09-22
Humm, sorry it seems I faulty constructed the index, indeed docFetcher finds within searcheable pdf (contractry to grepWin), but still you should look at the new --warn-empty from https://pdfgrep.org/news.html