Digital Library Software
Archive your personal history
Self-hosted search engine with web service to share discoveries with
Decentralized Web Search Engine
Very configurable web downloader
An open source search engine with RESTFul API and crawlers
Framework for search and display of heterogenous document collections.
WebCollector is an open source web crawler framework based on Java.
Simple Semantic Web Architecture and Protocol
Data migration/conversion library based on STX and XSLT transformation