Discontinued lightweight Desktop-Files/SMB/FTP crawler and search engine.
Roosster.org is a personal "on-demand" search engine. This means, it indexes only items/entries/files/URLs you explicitly tell it to index and provides a full-text-search over indexed items.
Arts is a collection of perl scripts that allow you to input text into a quick and dirty knowledge base. It's designed to save information that is generated by email away for safekeeping in an organized web index.
Deathwatch WebCrawler Personal search engine that runs on any Windows machine with the .NET Framework installed.
A platform & DB-independent method for receiving job or volunteer application online and subsequently evaluating and managing them. Developed by Oxfam Australia for the Humanitarian Relief Register: http://www.caa.org.au/helping/work/relief/index.html
JobClient downloads information from job-seeker sites, filters and sorts them against your skillset, and provides a GUI interface to browse and apply for jobs. Utilities are included for archiving, and screenscraping
Written in PHP and designed to maintain a personal database of bookmarks, Linkerdoodle is a simple link organizer.
Mac GoogleSeach is an OpenSource effort to implement the Google SOAP APIs on Mac OS X.
A Perl program that archives newsgroups and provides a web interface to the archive.
SpookShare is a protocol for posting and searching for messages over HTTP. It's main use is for file sharing. SWSpookShare is an implementation of the SpookShare protocol. It's [currently] written in perl5 and runs as a CGI on an real web server.
Satellite is a Perl website index/search package meant for indexing and searching medium size websites. Satellite currently supports text (.txt, .html etc) and pdf files. <br><br><a href=http://satellite2.sourceforge.net>Go here for a demo</a>
Tyriel is an open-source search engine written in python and designed to run within a small group of sites (but potentially extensible to a greater scope).
UTYP is a visual search service for pictures and an alternative challenge-response test to ensure that the response is not generated by a computer. Its framework is based on outsourcing visual recognition of images and picture to human playing games.
Values-based Document Analysis: I want to take some rudimentary Document Analysis work that I have done and make it more sophisticated and to use it to analyze (at least) all of the docuemnts of the web for (human) values priorities. The project woul
Websitemirror is a small program to download complete websites into a specific directory for offline viewing. Websitemirror ist ein kleines Programm welches eine komplette Webseite in einer Verzeichnis für offline browsen herunterläd.
Memephage is an automated web log (blog). It passively gathers and summarizes links from various places. Currently: IRC, social MUDs, e-mail, and web browsers. Uses the POE multitasking and networking framework for Perl.
Perl libraries to convert ebook formats and search ebook catalogs.
The main function of this script is to shorten long website-URL's -- converting long URL's into easy-to-remember, short ones. [htaccess, MOD_REWRITE, XHTML 1.0 strict, CSS 1, JS 1.2, PHP 5X, MySQL 4X]