Easyspider - Distributed Web Crawler Icon

Easyspider - Distributed Web Crawler

Easy Spider is a distributed Perl Web Crawler Project from 2006

5.0 Stars (1)
1 Download (This Week)
Last Update:
Download Easyspider - 01.11.2005.rar
Browse All Files
Windows Mac Linux

Screenshots

Description

Easy Spider is a distributed Perl Web Crawler Project from 2006. It features code from crawling webpages, distributing it to a server and generating xml files from it. The client site can be any computer (Windows or Linux) and the Server stores all data.

Websites that use EasySpider and Perl/PHP Backends:
https://www.buzzerstar.com/
https://www.buzzerstar.net/
https://www.buzzerstar.net/post.php
https://www.buzzerstar.com/post.php
https://www.buzzerstar.com/development/
https://www.facebook.com/BuzzerStar
http://sebastianenger.wordpress.com/
https://github.com/thecerial/iosec_addons
https://www.buzzerstar.net/jokes/
https://www.buzzerstar.com/kategorie/Entertainment
https://www.buzzerstar.net/games/mahjong/

Webcrawlers are mostly the first thing to start programming at if you start your programming career. It is fun to look at some code that is few years ago and to see how one has improved himself.

(c) Sebastian Enger 2005-2015

Easyspider - Distributed Web Crawler Web Site

Features

  • Client/Server Distributed Crawling
  • Perl Programming Language
  • Config File Support
  • PDF,DOC,XLS,PPT Extraction Support

KEEP ME UPDATED

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
Write a Review

User Reviews

  • sebastianenger
    1 of 5 2 of 5 3 of 5 4 of 5 5 of 5

    Easyspider is a perl client/Server architecture to crawl the web for interessting webpages. The Server can be any box that has internet access and allows perl programms to run. The client connects to the server, gets its working task, fullfills it and give the resuts as xml stream back to the server. the server then can install that xml file into a oracle/mysql/mariadb etc database or can be parsed by the sphinxsearch.com fulltext indexer to generate searchable content for your webpage. Happy Hacking ;-)

    Posted 05/10/2014
Read more reviews

Additional Project Details

Languages

English, German

Intended Audience

Telecommunications Industry, System Administrators, Developers

User Interface

Console/Terminal, Command-line

Programming Language

Perl

Registered

2014-05-09
Screenshots can attract more users to your project.
Features can attract more users to your project.