Retriever is a simple crawler packed as a Java library that allows developers to collect and manipulate documents reachable by a variety of protocols (e.g. http, smb). You'll easily crawl documents shared in a LAN, on the Web, and many other sources.

Project Activity

See All Activity >

Categories

Search Engines

License

Apache License V2.0

Follow Retriever: a light, extensible crawler

Retriever: a light, extensible crawler Web Site

Other Useful Business Software
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Retriever: a light, extensible crawler!

Additional Project Details

Languages

English

Intended Audience

Developers

User Interface

Other toolkit

Programming Language

Java

Related Categories

Java Search Engines

Registered

2007-12-03