A Java implementation of a flexible and extensible web spider engine.
Optional modules allow functionality to be added (searching dead links, testing the performance and scalability of a site, creating a sitemap, etc ..
No progress on this, I suppose... Crawler4j(2008) and Nutch(2009) are the on going ones and have stable releases also ...
Seems outdated. The linkchecker doesn't work; results in exceptions.