Crawl only greek domains

Help
multidanze
2013-01-25
2013-04-09
  • multidanze

    multidanze - 2013-01-25

    Hi, i would like my crawler to crawl and find greek domains .gr

    I used this: $crawler->addURLFilterRule("#\.(com|net|org|me|ly|)$# i"); NO LUCK
    and this: //$crawler->addURLFollowRule("#\.(gr)$# i"); NO LUCK

    Is there any other way?

     
  • Nobody/Anonymous

    Hi!

    Your rules don't work because they only match at the  end of the URL, e.g. in the URL "http://domain.gr/finle.html" the ".gr" is in somewhere in the middle of it.

    Try $crawler->addURLFollowRule("#^https?://*\.gr# i"), (didnt't test it thought)

     


Anonymous

Cancel  Add attachments





Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:





No, thanks