Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.

Project Activity

See All Activity >

License

GNU Library or Lesser General Public License version 2.0 (LGPLv2)

Follow Crawl-By-Example (Heritrix plugin)

Crawl-By-Example (Heritrix plugin) Web Site

Other Useful Business Software
$300 in Free Credit Towards Top Cloud Services Icon
$300 in Free Credit Towards Top Cloud Services

Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
Get Started
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Crawl-By-Example (Heritrix plugin)!

Additional Project Details

Languages

English

Intended Audience

Advanced End Users, Developers, Science/Research

User Interface

Web-based

Programming Language

Java

Related Categories

Java Search Engines, Java Information Analysis Software

Registered

2007-02-12