Menu

Recursive loop due to bad HTML code

Help
pmsteil
2013-03-28
2013-04-09
  • pmsteil

    pmsteil - 2013-03-28

    I have a problem in that when PHPCrawl gets to a certain page, there is an incorrectly formatted link which is causing an infinite loop…

    How can I warn PHPCrawl about a link it should ignore?

     
  • pmsteil

    pmsteil - 2013-03-29

    I figured out to use:

    $this->addURLFilterRule("#(clear\.gif)# i");

     
  • Uwe Hunfeld

    Uwe Hunfeld - 2013-04-08

    Hey Patrick,

    could you post the link which is causing the problem?
    So maybe i'm able to prevent the crawler from hanging in an infinite loop eventhough the link
    is corrupted, this should not happen.

    Thanks!

     

Anonymous
Anonymous

Add attachments
Cancel





Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.