Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.

Project Activity

See All Activity >

License

GNU Library or Lesser General Public License version 2.0 (LGPLv2)

Follow Crawl-By-Example (Heritrix plugin)

Crawl-By-Example (Heritrix plugin) Web Site

You Might Also Like
Free CRM Software With Something for Everyone Icon
Free CRM Software With Something for Everyone

216,000+ customers in over 135 countries grow their businesses with HubSpot

Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
Get free CRM
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Crawl-By-Example (Heritrix plugin)!

Additional Project Details

Languages

English

Intended Audience

Science/Research, Advanced End Users, Developers

User Interface

Web-based

Programming Language

Java

Related Categories

Java Search Engines, Java Information Analysis Software

Registered

2007-02-12