Retriever is a simple crawler packed as a Java library that allows developers to collect and manipulate documents reachable by a variety of protocols (e.g. http, smb). You'll easily crawl documents shared in a LAN, on the Web, and many other sources.

Project Activity

See All Activity >

Categories

Search Engines

License

Apache License V2.0

Follow Retriever: a light, extensible crawler

Retriever: a light, extensible crawler Web Site

Other Useful Business Software
AI-powered service management for IT and enterprise teams Icon
AI-powered service management for IT and enterprise teams

Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Try it Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Retriever: a light, extensible crawler!

Additional Project Details

Languages

English

Intended Audience

Developers

User Interface

Other toolkit

Programming Language

Java

Related Categories

Java Search Engines

Registered

2007-12-03