Text Processing
Showing page 1 of 2.
-
PDFBox PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
276 weekly downloads -
RText RText is a customizable programmer's text editor written in Java. Some of its features include: syntax highlighting, editing multiple documents at once, printing and print preview, find/replace/find in files dialogs, undo/redo, and online help.
93 weekly downloads -
jPod intarsys PDF library jPod is a rich PDF manipulation and rendering framework. A complete rendering library based on jPod is available here at "jPodRenderer". To see jPod & jPodRenderer at work, have a look at www.cabaret-solutions.com
37 weekly downloads -
FMPP - FreeMarker-based PreProcessor Command-line/Ant-task/embeddable text file preprocessor. Macros, flow control, expressions. Recursive directory processing. Extendable in Java to display data from any data sources (as database). Can generate complete homepages (tree of HTML-s, image
33 weekly downloads -
farsi-commons A Java toolbox with commonly used Farsi Language functions. Includes functions for text manipulation, standardization, normalization, search, replace and changing words and ligatures. Fixing White space problems, Jalai date and Calendar, etc...
75 weekly downloads -
DocDiff: Compare text word by word DocDiff compares two text files and shows the difference.
23 weekly downloads -
Morfologik Polish morphological analyzer and Java libraries interfacing it. First completely open-source and comprehensive morphological tools and finite-state technology for Polish and other languages.
63 weekly downloads -
latex-mk LaTeX-Mk is a collection of makefile fragments for managing small to large LaTeX based documentation projects. The idea is that especially large documents, there may be many many steps required to typeset the document (export modified figures to postscr
24 weekly downloads -
Conversion of other file formats to PDF xtopdf: Tools to convert other formats (x) to PDF; x as in math. - solve for x :-) Currently x == {.txt, .DBF}. Others to follow. Benefits: all those of PDF (better cross-platform viewing/printing, read-only, etc.)
7 weekly downloads -
NaNoWriTool NaNoWriTool is a text editor with features specifically geared towards NaNoWriMo, the National Novel Writing Month. It contains a live word counter, daily word count target, a timer for word wars and automatic backup feature.
8 weekly downloads -
wordaxe wordaxe (formerly deco-cow): A hyphenation library for Python. Several hyphenation algorithms: - the pattern-based from TeX/OOO, - by decomposition of compound words for German language. Includes support for paragraph line-breaking with ReportLab.
9 weekly downloads -
Java Text Processing Framework A framework that allows textprocessing to be integrated into any Java application in a generic manner. It represents an approbiate abstraction of the necessary elements and offers a generic interface to the application that needs a textprocessing service.
3 weekly downloads -
Wiki2TEI Convert wiki pages to TEI.
3 weekly downloads -
Markout Markout is a pure-Java lightweight wiki markup parser based on John Gruber's Markdown.
2 weekly downloads -
Notepage A +featured text editor based on Java.
2 weekly downloads -
RCodeLeveler A Ruby file parser/interpreter/preprocessor that comments lines of code based on conditions at the time the file is required. Very handy to implement debugging logs and code that has to be commented (not just dynamically switched off).
2 weekly downloads -
jtr Java library that emulates the Perl 5 "transliterate" operation on a given string. Most Perl 5 features are supported, including all the standard modifiers and most Perl escape sequences. Patterns are compiled for speed, and runtime performance is fast.
2 weekly downloads -
TexBeans a set LaTex plugins for Netbeans with full project management (multiple files allowed), editor, code completion (Ctrl-Space), build and view support (latex, bibtex and xdvi, linux), code injection (Alt-Enter), spellcheck, error and warning handling ....
1 weekly downloads -
D2AC-A2DC From March 2011, this project has moved to: rcrl.sourceforge.net
1 weekly downloads -
DocBook Publishing Utilities The DocBook Publishing Utilities tools, which make creation and publishing of DocBook easier. The tools are: Maven plug-in to Transform HTML into XML (use after docbkx); Eclipse DocBook table editor; Eclipse wizards for initial DocBook files.
1 weekly downloads -
Doco Doco is a simple but feature rich and powerful markup language for converting text documents into highly-presentable and navigable web content.
1 weekly downloads -
Hierarchical Project Tree This tool is designed to help break a project down into smaller and smaller chunks, allowing you to go into fine detail without losing sight of the big picture. Particularly good for certain types of dyslexia.
1 weekly downloads -
JNotePad JNotePad is a very flexible text editor. With lots of modules, cou can create your own user-friendly editor. By choosing only the modules YOU need, you get a very productive editor.
1 weekly downloads -
Java Transliterator translit is a J2EE web application written in Java to execute convertion between different encodings.
1 weekly downloads -
Pyana Pyana is a extension module that allows Python programs to interface with the Apache Software Foundation's Xalan XSLT transformation engine.
1 weekly downloads