glark offers grep-like searching of text files, with very powerful, complex regular expressions (e.g., "/foo\w+/ and /bar[^\d]*baz$/ within 4 lines of each other"), as well as highlighting of matches.
(XSLT transformer/editor)
A text editor that allows the loading and editing of an XML document and an XSLT document at the same time. It also can apply the XSLT to the XML and display the output for further editing/saving. Plugable XML and XSLT parsers
Mediawiki-PDF is a mediawiki extension to convert wiki articles into PDF Documents. The extension uses HTMLDOC to convert the wiki pages from plain HTML into PDF.
JLatex provides unified API, which is used to generate documents in LaTeX format. Framework consists of Java classes intented to provide 1:1 LaTeX features compatibility.
JReplace is a search & replace utility that searches and replaces multi-line text in files in a directory or subdirectories. Perfect for modifying a whole website at once. Files can be filtered by extension. Written in Java.
C.E.L. Pad is a light-weight text editor written in Java. This program is cross-platform. C.E.L. Pad enables the user to select different look and feel options such as changing fonts, size, and Look & Feel.
Querylog is a console tool for performing SQL queries on a (log) files. Lines from one or more text files or stdin are matched, using regular expressions to an in memory database on which SQL queries can be performed.
Docyrus is a web application allowing to create articles from imported chapters or articles. You can store local documents (doc, wiki, HTML, RSS, openOffice) online, manage them online and export them in many formats (pdf, openOffice, HTML,...).
EsTexte is a text-to-HTML based on an intuitive text format akin to various wiki formats and ascii text files. Written in Java, it can be used from the command-line or from other Java programs.
The PSP Bookreader allows you to read eBooks on your PlayStation Portable. OK, it isn't really a reader and doesn't even run on the PSP itself, but the name is catchier than "text to JPEG converter". Features preview, configurable output, easy to use GUI.
A student of the Franklin W. Olin College of Engineering wrote his own extremely customizeable, extraordinarily functional, tabbed text editor in Python and pygtk. Works on windows and POSIX-compliant systems. For the scripter and excessive customizer.
Strip out useless tags and other junk from HTML files. Shrink files, enhance readability of HTML source, promote privacy, and clean HTML exported from Microsoft Word (MS-Word). Run HTMLStrip as-is or customize it with your own regular expressions.
Java library to convert FCK Editor XHTML into pdf, using iText. The goal is to provide implementation through API and also via Java Servlet; and to embed a PDF Preview into FCK Editor (as in the HTML Preview), referring to a Servlet URL.
JTF (Java Text Formatter) is a plain (latin) text simple formatter. JTF will format the inputed text to an well-formed text, with considering: line width, justification, table, cell, padding, and other parameters that correspond to formatting a text.
XiMoL is an XML reader/writer (non-validating) library written in C++.
It is a iostream-oriented library based on the STL and
not a SAX or DOM library (like Xerces, expat, ...).
Each object has its own reader/writer (operator<< and operator>>).