MixDEM a web based ETL tools meant for Web integration, Data transformation and Mashup edition. It include MixDEM ETL Engine created using ZEND Framework, and MixDEM GUI Editor an AJAX IDE that enable developers to quickly and easily create applications.
PySMBSearch is a crawler and search engine for SMB shares. It consists of a crawler script, which creates an index and stores it in an SQL database, and a CGI script that can be used to extract queries from the database.
Vertical Web Extractor is a project to extract the data of products or something else in congeneric websites. Developed with java ,using SWT/JFace/RCP technology.Suitable for windows and linux.It's somehow like a vertical search engine plus data extract