Hi Uwe. thanks for the quick reply! Yep that does the trick, thanks a lot!
I find it quite odd that this is TRUE by default. Would it not make more sense to be other way around? I'm curious to hear the explanation..
anyway thanks a lot for a great product and great support!
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
yes, you are right! This setting is enabled by default mainly because of compatibility resaons. In early versions of phpcrawl the crawler was always searching for links "agressive", and later on this option was added to turn off this behaviour optionally.
In future versions this method will get removed/deprecated anyway and will be replaced with some settings that let's the user define more detailed ans transparently where the crawler should look for links within the source a html-page (like script-parts, css-parts, uncommented html/script-code and so on).
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Page requested: http://www.testwebsite/test-page/( (404)
The crawler is appending a left closing parenthesis "(" to the end of some URLs and returning 404.
Any idea what's causing this?
Thanks,
Mark
View and moderate all "Help" comments posted by this user
Mark all as spam, and block user from posting to "Forum"
Bump! i have same issue...there are no links ending in parenthesis on my site..
Page requested: http://www.mysite.com/en/canyoning/( (404)
Referer-page: http://www.mysite.com/en/canyoning/
Content received: 2753 bytes
Any ideas?
Last edit: Anonymous 2014-03-13
Hi!
Did you try to set $crawler->enableAggressiveLinkSearch(false) ?
View and moderate all "Help" comments posted by this user
Mark all as spam, and block user from posting to "Forum"
Hi Uwe. thanks for the quick reply! Yep that does the trick, thanks a lot!
I find it quite odd that this is TRUE by default. Would it not make more sense to be other way around? I'm curious to hear the explanation..
anyway thanks a lot for a great product and great support!
Hey again,
yes, you are right! This setting is enabled by default mainly because of compatibility resaons. In early versions of phpcrawl the crawler was always searching for links "agressive", and later on this option was added to turn off this behaviour optionally.
In future versions this method will get removed/deprecated anyway and will be replaced with some settings that let's the user define more detailed ans transparently where the crawler should look for links within the source a html-page (like script-parts, css-parts, uncommented html/script-code and so on).