The Crawler recieves some data but not from the result table. Something is wrong but I have no idea.
Any Ideas? If you need the full url I will send it by email or pn..
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
Anonymous
-
2020-11-13
Post awaiting moderation.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
The site you are tallking about is doing it's post via AJAX (javacript).
What you get in your script is an XML-document containing the search result.
The javascript on the page is getting this xml after the post and puts it's content into the DOM of the search-page and displays it in a html-table.
So there is no way for a crawler to get the contents of the result-page directly (as far as i can see), but you get the results as XML, that's even better i'd say.
Hope i could help!
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hey,
I am trying to crawl some data after a search form has been send..
-- search form
After the form has been send the result table shows up..
$crawler = new MyCrawler();
$crawler->setURL("xy");
$post_data = array("ausschreibungssuche_hf_0" => "", "searchString" => "", "publishDateRange" => "ALL","submitButton" => 1);
$crawler->addPostData("#http://www.xy.de/search.html?4-1.IBehaviorListener.0-html-body-searchForm-submitButton#", $post_data);
$crawler->addContentTypeReceiveRule("#text/html#");
$crawler->addURLFilterRule("#.(jpg|jpeg|gif|png)$# i");
$crawler->addURLFilterRule("#.(css|js)$# i");
$crawler->enableCookieHandling(true);
$crawler->setUserAgentString('Mozilla/5.0 (Windows NT 6.1; WOW64; rv:33.0) Gecko/20100101 Firefox/33.0');
$crawler->setFollowMode(0);
$crawler->setFollowRedirects(TRUE);
$crawler->setTrafficLimit(1000 * 1024);
$crawler->go();
The Crawler recieves some data but not from the result table. Something is wrong but I have no idea.
Any Ideas? If you need the full url I will send it by email or pn..
Hi!
Could you send me the URL for a short test? (pn)
View and moderate all "Help" comments posted by this user
Mark all as spam, and block user from posting to "Forum"
Hi!
I got your PN, but didn't come to run a test so far (lot's to do), but i will the next days. Just wanted you to know.
View and moderate all "Help" comments posted by this user
Mark all as spam, and block user from posting to "Forum"
Hi again,
i just tested the site you mentioned.
The site you are tallking about is doing it's post via AJAX (javacript).
What you get in your script is an XML-document containing the search result.
The javascript on the page is getting this xml after the post and puts it's content into the DOM of the search-page and displays it in a html-table.
So there is no way for a crawler to get the contents of the result-page directly (as far as i can see), but you get the results as XML, that's even better i'd say.
Hope i could help!