Pamda is an MDA tool (UML to code generator). It uses UML API to parse XMI and thus, unlike some other MDA tools, is lenient to XMI extensions. Implemented as Ant task and uses Velocity template engine. Tested with XMI produced by Enterprise Architect.
Chaperon is a LALR(1) parser, which parse structured text documents and
generate XML documents as output. It includes a parser generator like yacc
and a regex scaner like lex. As input use Chaperon a grammar written in XML.
Ripsaw is a desktop application that allows you to create, edit, group and de/activate a regular expressions and then use them to parse input from a file or pasted in source. Ripsaw is built in Java and uses SWT for fast responsive forms.
Java API to process or parse HTML documents.
If your Java application needs or would like to be able to process some text in HTML format, you'd probably find this API interesting.
The Personal Data Interchange (PDI) standard was initially developed by Versit for the communication of data including the popular vCard and vCalendar types.
This Java package allows a programmer to take an InputStream and parse its PDI data.
WhiteBeer is a new programming paradigm that is motivated by Noam Chomsky's Minimalist Program. It uses feature checking mechanism to parse programs so that it can provide word-order-independence like natural language processing.
DISCONTINUED!
StoffelLOC is a tiny tool to count Lines of Code. It can parse a single file or whole structures and it can be used to count more than just .java files. I started it because needed it to count the lines in Java Serverpages.
At the moment
Java library for Schema based file parser.
Csv and flat file parser and writer.
This is a java library that enables to parse and write Csv and flat files based on a schema.
XMLStruct is a code generation tool that produces Java classes to parse XML based on a schema description such as XMLSchema and a set of attached actions.
This date parser for Java will parse the input from human people as date value. Differently to the SimpleDateFormat it will completely missing part with the current date. It will support all available locales (i18n). It has a very small foot print.
The Microsoft Bytecode Engineering Library is a library written completely in Java that allows the user to parse, create, edit, and rewrite .NET assemblies. The code originally came from the University of Arizona. http://www.cs.arizona.edu/projects/mbel/