Retriever is a simple crawler packed as a Java library that allows developers to collect and manipulate documents reachable by a variety of protocols (e.g. http, smb). You'll easily crawl documents shared in a LAN, on the Web, and many other sources.
LicenseApache License V2.0
Follow Retriever: a light, extensible crawler
Rate This ProjectLogin To Rate This Project
Be the first to post a review of Retriever: a light, extensible crawler!