PHPScraper is a universal web-scraping util for PHP, built with simplicity in mind. The goal is to make xPath Selectors optional and avoid the commonly needed boilerplate code. Just create an instance of PHPScraper, go to a website, and start collecting data. All scraping functionality can be accessed either as a function call or a property call. For example, the title can be accessed in two ways. Many common use cases are covered already. You can find prepared extractors for various HTML tags, including interesting attributes. You can filter and combine these to your needs. In some cases there is an option to get a simple or detailed version. PHPScraper can assist in collecting feeds such as RSS feeds, sitemap.xml-entries and static search indexes. This can be useful when deciding on the next page to crawl or building up a list of pages on a website.

Features

  • Process the RSS feeds, sitemap.xml, etc.
  • Process CSV-, XML- and JSON files and URLs
  • Batteries included: Meta data, Links, Images, Headings, Content, Keywords
  • Flexible Calling as an Attribute or Method
  • There are plenty of examples on the PHPScraper website and in the tests
  • You can configure proxy support

Project Samples

Project Activity

See All Activity >

Categories

Web Scrapers

License

GNU General Public License version 3.0 (GPLv3)

Follow PHPScraper

PHPScraper Web Site

Other Useful Business Software
Fully Managed MySQL, PostgreSQL, and SQL Server Icon
Fully Managed MySQL, PostgreSQL, and SQL Server

Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of PHPScraper!

Additional Project Details

Operating Systems

Linux

Programming Language

PHP

Related Categories

PHP Web Scrapers

Registered

2023-04-10