Retriever is a simple crawler packed as a Java library that allows developers to collect and manipulate documents reachable by a variety of protocols (e.g. http, smb). You'll easily crawl documents shared in a LAN, on the Web, and many other sources.

Project Activity

See All Activity >

Categories

Search Engines

License

Apache License V2.0

Follow Retriever: a light, extensible crawler

Retriever: a light, extensible crawler Web Site

Other Useful Business Software
Grafana: The open and composable observability platform Icon
Grafana: The open and composable observability platform

Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Retriever: a light, extensible crawler!

Additional Project Details

Languages

English

Intended Audience

Developers

User Interface

Other toolkit

Programming Language

Java

Related Categories

Java Search Engines

Registered

2007-12-03