Web scraping (web harvesting or web data extraction) is data scraping used for extracting data from websites.[1] Web scraping software may access the World Wide Web directly using the Hypertext Transfer Protocol, or through a web browser. While web scraping can be done manually by a software user, the term typically refers to automated processes implemented using a bot or web crawler. It is a form of copying, in which specific data is gathered and copied from the web, typically into a central local database or spreadsheet, for later retrieval or analysis.

Web scraping a web page involves fetching it and extracting from it.[1][2] Fetching is the downloading of a page (which a browser does when you view the page). Therefore, web crawling is a main component of web scraping, to fetch pages for later processing. Once fetched, then extraction can take place. The content of a page may be parsed, searched, reformatted, its data copied into a spreadsheet, and so on.

Project Samples

Project Activity

See All Activity >

License

MIT License, Apache License V2.0, Affero GNU Public License, GNU General Public License version 3.0 (GPLv3), GNU Library or Lesser General Public License version 3.0 (LGPLv3)

Follow Perl Web Scraping Project

Perl Web Scraping Project Web Site

You Might Also Like
User Testing Platform | Testeum Icon
User Testing Platform | Testeum

Get worldwide testers to review your software, app or website! Quickly find bugs and usability issues in less than 48 hours.

Tired of bugs and poor UX going unnoticed despite thorough internal testing? Testeum is the SaaS crowdtesting platform that connects mobile and web app creators with carefully selected testers based on your criteria.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Perl Web Scraping Project!

Additional Project Details

Operating Systems

Linux, Mac, Windows

User Interface

Tk

Programming Language

Perl

Related Categories

Perl Internet Software, Perl Web Scrapers

Registered

2017-10-12