Roach is a complete web scraping toolkit for PHP. It is a shameless clone heavily inspired by the popular Scrapy package for Python. Roach allows us to define spiders that crawl and scrape web documents. But wait, there’s more. Roach isn’t just a simple crawler, but includes an entire pipeline to clean, persist and otherwise process extracted data as well. It’s your all-in-one resource for web scraping in PHP. Roach doesn’t depend on a specific framework. Instead, you can use the core package on its own or install one of the framework-specific adapters. Currently, there’s a first-party adapter available to use Roach in your Laravel projects with more coming. Roach is built from the ground up with extensibility in mind. In fact, most of Roach’s built-in behavior works the exact same way that any custom extensions or middleware works.

Features

  • Roach is a complete web scraping toolkit for PHP
  • Roach is built from the ground up with extensibility in mind
  • Roach doesn’t depend on a specific framework
  • There’s a first-party adapter available to use Roach in your Laravel projects with more coming
  • Includes an entire pipeline to clean, persist and otherwise process extracted data as well
  • It’s your all-in-one resource for web scraping in PHP

Project Samples

Project Activity

See All Activity >

Categories

Web Scrapers

License

MIT License

Follow Roach

Roach Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Roach!

Additional Project Details

Programming Language

PHP

Related Categories

PHP Web Scrapers

Registered

2023-04-12