Menu

Filter Issue

Help
Anonymous
2014-08-23
2014-12-24
  • Anonymous

    Anonymous - 2014-08-23

    I have noticed several instances where the system is still tries to index css documents if the site is using variables in the url.

    Example
    domain.com/css/style.css?id=1.2

     
  • Uwe Hunfeld

    Uwe Hunfeld - 2014-10-08

    Hi!

    Yes, but what's the problem?
    WHat do you mean with "still"?

    By default, phpcrawl tries to crawl everythinbg it finds.
    Just set a filter (or more filters) for the documents you don't want
    the crawler to receive.

     
  • Anonymous

    Anonymous - 2014-12-24

    I was having the same issue you just need to modify the Regular Expression;

    $crawler->addURLFilterRule("#.(css|js)?*# i");

    This will stop the query string and filter out anything that is .css?

    There is a good doc on regular expressions for analytics but it just runs through basic regular expressions - http://www.lunametrics.com/regex-book/Regular-Expressions-Google-Analytics.pdf

     

Anonymous
Anonymous

Add attachments
Cancel





Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.