Thank you for the informations, I'm going to follow the proposed links.
I'm just integrating the crawler inside my interface. It's working, thank you.
Now I going to test the solr integration
Didier
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Reading again the post, I think that I need to ask my question in a different way.
Do you know a class to seach inside the code (html, pdf, ...) retreived by your crawler?
Thank you,
Didier
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
Is it possible to link your crawler to make solr indexing the files/records crawled?
Didier
Hi Didier,
i personally don't know anything about solr, im sorry, so i can't give you instructions or hints.
But some users of phpcrawl mentioned that they are using phpcrawl and solr together, so it seems
to be possible.
(Like here: https://sourceforge.net/projects/phpcrawl/forums/forum/307696/topic/5103273)
Maybe you may ask these users.
… just did a quick google search and found this project:
http://code.google.com/p/open-unsane-server/
They somehow seem to deal with solr and phpcrawl in their project (http://code.google.com/p/open-unsane-server/source/browse/trunk/unsanecrawl/classes/unsane.crawl.class.php), but i have no clue what's going on there (since i dont know solr).
Maybe it's a help for you.
View and moderate all "Help" comments posted by this user
Mark all as spam, and block user from posting to "Forum"
Hi Uwe,
Thank you for the informations, I'm going to follow the proposed links.
I'm just integrating the crawler inside my interface. It's working, thank you.
Now I going to test the solr integration
Didier
View and moderate all "Help" comments posted by this user
Mark all as spam, and block user from posting to "Forum"
Reading again the post, I think that I need to ask my question in a different way.
Do you know a class to seach inside the code (html, pdf, ...) retreived by your crawler?
Thank you,
Didier
Hi Didier,
what do you want to search for inside the code? URLs/links?