PHPCrawl is a high configurable webcrawler/webspider-library written in PHP. It supports filters, limiters, cookie-handling, robots.txt-handling, multiprocessing and much more.
License
GNU General Public License version 2.0 (GPLv2)Follow PHPCrawl
Other Useful Business Software
MongoDB Atlas runs apps anywhere
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Rate This Project
Login To Rate This Project
User Reviews
-
***A*W*E*S*O*M*E***
-
Wow, this crawler has it all. It is - even with a point zero release faster and more mature and feature rich than every other I tried. Especially useful is the test interface, where you can try out all the parameters, without coding. This is excellent work!
-
Great tool to crawl sites, excelent support
-
Could use a "Per domain page-limit" :)
-
Very good job. Hard to find better!