- License: Apache License V2.0 ×
Text Processing
Showing page 1 of 2.
-
DITA Open Toolkit The DITA Open Toolkit is an implementation of the OASIS DITA XML Specification. The Toolkit transforms DITA content into many deliverable formats. See http://dita.xml.org/wiki/the-dita-open-toolkit for information about releases and download packages The source code and issue trackers have been moved to https://github.com/dita-ot/dita-ot
251 weekly downloads -
MindRaider MindRaider is a personal notebook and outliner. Where do you keep private remarks like ideas, plans, gift tips and howtos? Loads of documents and remarks spread around the file system? Can you find a remark when you need it? No? Try MindRaider!
71 weekly downloads -
Shared Questionnaire System Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
9 weekly downloads -
HyFo extended pattern hyphenation HyFo provides hyphenation services in Java 5 for developers in multiple languages. It extends TeX-style hyphenation to directly support re-spelling hyphenations such as TeX's discretionary hyphens.
0 weekly downloads -
LineFold - TeX line-breaking for Java 6 LineFold is a Java 6 implementation of the TeX paragraph line-breaking algorithm.
0 weekly downloads -
DITA2wiki DITA2wiki is a toolkit that enables you to publish DITA content (maps and topics) to a wiki.
2 weekly downloads -
Articlefox Articlefox is a workflow system that can be used to prepare the articles of a small journal.
1 weekly downloads -
Leseratte Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
0 weekly downloads -
wordaxe wordaxe (formerly deco-cow): A hyphenation library for Python. Several hyphenation algorithms: - the pattern-based from TeX/OOO, - by decomposition of compound words for German language. Includes support for paragraph line-breaking with ReportLab.
7 weekly downloads -
Tubaina Tubaina is a book generator. Given a text written in afc syntax, a markup language, an html or pdf output is generated. This project has been moved to Github: http://github.com/caelum/tubaina
1 weekly downloads -
SchemaWalker The SchemaWalker is a Java application able to read a any schema and produce XForms web pages for user selected nodes grouped into webpages to allow editing of XML data files.
0 weekly downloads -
Pergamon Pergamon is a java library for extracting metadata and structured text from a variety of file types.
1 weekly downloads -
OpenDarkRoom OpenDarkRoom is an open source cross platform Swing-based Java application that attempts to mimic the OSX-native WriteRoom. OpenDarkRoom allows you to focus on the writing, away from any distractions.
1 weekly downloads -
FlatPack Java API For Flat Files Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete feature list.
32 weekly downloads -
MemoryML MemoryML is a project for creating XML-based markup for personal memories and histories.
0 weekly downloads -
Jeckit A Java-based spellchecker which focuses on automatic spelling correction by incorporating lingustic and statistical approaches. Development is done by ASV (Abteilung Automatische Sprachverarbeitung) of Leipzig University.
0 weekly downloads -
FigTeX FigTeX manages images and their easy inclusion in LaTeX documents. Similar to BibTex, the image information is stored in an external file and is imported into the document as needed. It comes with a comfortable GUI for managing the image library.
3 weekly downloads -
Fiction Book to Palm ZTxt converter The simple converter form Fiction Book files (www.fictionbook.org) to Palm ZText pdb format.
1 weekly downloads -
codemod Codemod is a tool/library to assist you with large-scale codebase refactors that can be partially automated but still require human oversight and occasional intervention.
0 weekly downloads -
The Guide The Guide is a tree-based information management tool. It lets you to organize information as nodes in a tree. (A two-pane rich-text outliner for Windows.)
160 weekly downloads -
Piccolo XML Parser for Java Piccolo is the fastest SAX parser for Java, supporting SAX1, SAX2, and JAXP (SAX only). Piccolo is different from other parsers in that it was developed using parser generators. It weighs 160K including XML APIs. See http://piccolo.sf.net for more info.
18 weekly downloads -
Yatti Yatti (rhymes with "patty") is a Java-based template engine. Given a template file with plain text and Java commands, Yatti will interpret the file and produce the appropriate output.
0 weekly downloads -
Twepuo Twepuo is a generic notebook in a form of web-application.
0 weekly downloads -
XMLObject About XMLObject XMLObject is a simple library to covert objects into XML and create objects from XMLs.
0 weekly downloads -
Excel To Java Bean This project is for converting excel file into Java Bean objects.
0 weekly downloads