FOX is a Java library for parsing documents in the Open Financial Exchange (OFX) format.
HtmlClient provides an SGML/HTML/XHTML parser and connection client making web-spidering as easy for developers as actually surfing the web with a premade browser. Based on Apache's HttpClient.
SAX-like API for SGML (SGML parser for Java)
SGML parser for Java, based on OpenSP.
A library, which gives an access to the TREC (Text REtrieval Conference) collections in iterative way (document by document).