I'm trying to crawl a site that has links like:
<a id="blah" href="javascript:__doPostBack('blah$erg$something$blah$sdsd','')">link text here</a>
How can i get that additional data that's returned via ajax from this link?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I'm a PHPCrawl newbie, so this is just from the hip:
If you specifically know that those JS-style links are a part of the page, could you not just do a suitable regex in your handleDocumentInfo where you extend the PHPCrawl class and override the handleDocumentInfo()-method?
I'm trying to crawl a site that has links like:
<a id="blah" href="javascript:__doPostBack('blah$erg$something$blah$sdsd','')">link text here</a>
How can i get that additional data that's returned via ajax from this link?
Hi,
since phpcrawl doesn't have a impplemented js-interpreter, it's probably just not possible.
Sorry and best regards!
I'm a PHPCrawl newbie, so this is just from the hip:
If you specifically know that those JS-style links are a part of the page, could you not just do a suitable regex in your handleDocumentInfo where you extend the PHPCrawl class and override the handleDocumentInfo()-method?
And perhaps use https://sourceforge.net/projects/phpcrawl/forums/forum/307696/topic/5300676 to examine the new url?
:)