An object relational-mapping (ORM) library for Java
A PHP search engine for your website and web analytics tool. GNU GPL3
Self-hosted search engine with web service to share discoveries with
Easy Spider is a distributed Perl Web Crawler Project from 2006
Xaraya is a web application framework and CMS written in PHP.