I copied the configuration line for PDF indexing from the demo server.
I installed PDFTOTXT for windows and it works fine.
In the PDF indexing configuration I put this line :
C:\xampp\htdocs\pdftotext.exe -nopgbrk %s | sed -e's/[a-zA-Z0-9.]{1} / /g'
This create the txt file but nothing more.
I did some research and the "sed" command is not available on Windows. There is however the command "where" for PowerShell but php runs cmd.exe and not PowerShell.
How to use plaintxt indexing under windows?
Sincerely
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thanks you for the replie. I have delete the sed commande and it's works.
But when i search a terms, he dont find any document.
I can see the txt file with the PDF extraction text.
Any idea?
Last edit: Bob 2018-05-22
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hello,
I copied the configuration line for PDF indexing from the demo server.
I installed PDFTOTXT for windows and it works fine.
In the PDF indexing configuration I put this line :
C:\xampp\htdocs\pdftotext.exe -nopgbrk %s | sed -e's/[a-zA-Z0-9.]{1} / /g'
This create the txt file but nothing more.
I did some research and the "sed" command is not available on Windows. There is however the command "where" for PowerShell but php runs cmd.exe and not PowerShell.
How to use plaintxt indexing under windows?
Sincerely
The sed call is required, it just filters out some terms not worth to index. I'd suggest to try it without the sed and see if it works.
Hello,
Thanks you for the replie. I have delete the sed commande and it's works.
But when i search a terms, he dont find any document.
I can see the txt file with the PDF extraction text.
Any idea?
Last edit: Bob 2018-05-22
Sorry dupplicated message xD
Last edit: Bob 2018-05-22
Up