A threaded Web graph (Power law random graph) generator written in Python. It can generate a synthetic Web graph of about one million nodes in a few minutes on a desktop machine. It implements a threaded variant of the RMAT algorithm.
This is a collection of REST specifications, and implementations of those specs, for very low-level information sharing and workflow operations using REST actions over HTTP. Implementations are in various languages, mainly Java, Python, and Ruby.
TOPCLASS Online Periodical Content Library and Submission System (TOPCLASS) allows periodical publications to receive, catalog, and retrieve content submissions over the web. Among other technologies, TOPCLASS uses Django, MySQL, and AJAX.
Configurable wiki parser and XML based engine for easy generation of documentation.
Pure-Python search engine loosely based on Lucene.
Sjaman - a web server plugin for enabling the semantic web. Its main functionalities are part-of-speech tagging, automatic language detection and clustering of web pages. This can increase the quality of future (semantic) web search and applications.
Solrscan is a tool for posting Solr format Xml documents to a Solr Index. It has support for full or incremental mode and maintains a cache of the current state of an index.