PHPCrawl is a high configurable webcrawler/webspider-library written in PHP. It supports filters, limiters, cookie-handling, robots.txt-handling, multiprocessing and much more.
License
GNU General Public License version 2.0 (GPLv2)Follow PHPCrawl
Other Useful Business Software
Resolve Support Tickets 2x Faster with ServoDesk
What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
Rate This Project
Login To Rate This Project
User Reviews
-
***A*W*E*S*O*M*E***
-
Wow, this crawler has it all. It is - even with a point zero release faster and more mature and feature rich than every other I tried. Especially useful is the test interface, where you can try out all the parameters, without coding. This is excellent work!
-
Great tool to crawl sites, excelent support
-
Could use a "Per domain page-limit" :)
-
Very good job. Hard to find better!