Menu

Page requested: http://www.highrankwebsites.com/custom-infographics/( (404)

Help
Mark
2013-10-15
2014-03-13
  • Mark

    Mark - 2013-10-15

    Page requested: http://www.testwebsite/test-page/( (404)

    The crawler is appending a left closing parenthesis "(" to the end of some URLs and returning 404.

    Any idea what's causing this?

    Thanks,
    Mark

     
  • Anonymous

    Anonymous - 2014-03-13

    Bump! i have same issue...there are no links ending in parenthesis on my site..
    Page requested: http://www.mysite.com/en/canyoning/( (404)
    Referer-page: http://www.mysite.com/en/canyoning/
    Content received: 2753 bytes

    Any ideas?

     

    Last edit: Anonymous 2014-03-13
  • Uwe Hunfeld

    Uwe Hunfeld - 2014-03-13

    Hi!

    Did you try to set $crawler->enableAggressiveLinkSearch(false) ?

     
  • Anonymous

    Anonymous - 2014-03-13

    Hi Uwe. thanks for the quick reply! Yep that does the trick, thanks a lot!
    I find it quite odd that this is TRUE by default. Would it not make more sense to be other way around? I'm curious to hear the explanation..
    anyway thanks a lot for a great product and great support!

     
  • Uwe Hunfeld

    Uwe Hunfeld - 2014-03-13

    Hey again,

    yes, you are right! This setting is enabled by default mainly because of compatibility resaons. In early versions of phpcrawl the crawler was always searching for links "agressive", and later on this option was added to turn off this behaviour optionally.

    In future versions this method will get removed/deprecated anyway and will be replaced with some settings that let's the user define more detailed ans transparently where the crawler should look for links within the source a html-page (like script-parts, css-parts, uncommented html/script-code and so on).

     

Anonymous
Anonymous

Add attachments
Cancel





Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.