WebSPHINX
Description
WebSPHINX is a web crawler (robot, spider) Java class library, originally developed by Robert Miller of Carnegie Mellon University. Multithreaded, tollerant HTML parsing, URL filtering and page classification, pattern matching, mirroring, and more.
WebSPHINX Web SiteUser Ratings
User Reviews
-
Easy to install and use.
-
Great visualization shown... good work done on the GUI part also... WebSphinx crawls all the links on the given url, and crawls along. Everything is configurable.