WebCollector is an open source web crawler framework based on Java.
XML bindings and a GUI for creating and editing XBMC Scrapers
Simple application for downloading pictures from Zerochan.net
Robots.txt parsing library
Auto Rescanning - Search Terms - Regularly Updated With New Features
Interactive web-search.
Simple Semantic Web Architecture and Protocol
IRToolkit
Data migration/conversion library based on STX and XSLT transformation
Instant File Search
You can analyze a, img, h1, h2 tags in your site.