This project aims to build a suite of Natural Language Processing tools. Modules will include corpus indexing and access tools, a part-of-speech tagger, tokenisers, text classification software, etc.
The JSocket Wrench is a JDK 5.0 library (with a thin JNI layer) that provides support for low-level internet protocols in the Java programming language through subclasses of java.net.SocketImpl and java.net.DatagramSocketImpl.
EBML, or Extensible Binary Meta-Language, is a simple XML like binary language for describing data in structured style. EBML was originally designed for use in the Matroska project, but the developers saw that EBML was very flexible and extensible.