Text Processing
Showing page 1 of 3.
-
ADAPRO Free alternative to Crick Software's Clicker
167 weekly downloads -
ASTL Automata Standard Template Library ASTL Automata Standard Template Library (Vincent Le Maout - Dominique Revuz) is a set of generic and efficient C++ components for automata manipulation.
5 weekly downloads -
Anaphraseus Anaphraseus is a CAT (Computer Aided Translation) tool, OpenOffice.org 2-3 macro set similar to famous Wordfast. It works with the Wordfast Translation Memory format (*.TXT), and supports text segmentation.
136 weekly downloads -
Apolda Apolda is a plugin for the Gate framework (see http://sourceforge.net/projects/gate/) that annotates texts with labels of concepts from an arbitrary OWL-ontology.
1 weekly downloads -
CKEditor FCKeditor (retired)
1,856 weekly downloads -
CSV-O-MATIC A python script that uses wxwidgets. View or edit delimited data.
3 weekly downloads -
Colorer Library Colorer provides source text syntax highlighting services. It colorizes source codes in editor systems (more than 200 syntaxes). Uses powerful HRC format(XML, RE, context free grammas), allowing to support any language. Available as Eclipse plugin.
410 weekly downloads -
DJVUEd GUI for djvused program from DjVuLibre package. Can help avoid some mistakes during editing annotations, oulines, hidden text and naming pages.
3 weekly downloads -
Diffuse Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
666 weekly downloads -
DocBook Publishing Utilities The DocBook Publishing Utilities tools, which make creation and publishing of DocBook easier. The tools are: Maven plug-in to Transform HTML into XML (use after docbkx); Eclipse DocBook table editor; Eclipse wizards for initial DocBook files.
1 weekly downloads -
ElixirFM Functional Arabic Morphology
10 weekly downloads -
EpiDoc: Epigraphic Documents in TEI XML XML text markup for ancient documents
6 weekly downloads -
FAR - Find And Replace Search and replace operations on file content accross multiple files. Recursive operations within entire directory trees. FAR comes with support for regular expressions (regex) over multiple lines, automatic backup and various character encodings.
294 weekly downloads -
GATE GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.
597 weekly downloads -
Galician dictionaries This project offers galician dictionaries for several spell checkers: "Ispell", "Myspell" , "Aspell", "Spell Checker for Edit Boxes" and "Excalibur".
3 weekly downloads -
Guiguts Guiguts is a Perl/Tk text editor designed for editing and formatting public domain material for inclusion at Project Gutenberg (www.gutenberg.org). Features are provided for editing text files produced by Distributed Proofreaders (www.pgdp.net). For help or to contact the developers, see http://www.pgdp.net/phpBB2/viewtopic.php?t=46944
14 weekly downloads -
HISPACEDIC Downloadable and open source Chinese-Spanish vocabulary inspired by the CEDICT and EDICT dictionaries. It is distributed in a plain Unicode text file that can be easily ported to other formats or used by different applications.
2 weekly downloads -
JEncConv Encoding Converter Convert the encoding of text files, e.g. subtitles. Detect valid encodings. Preview (in different encodings) before converting. Customizable error behavior (fail, replace, ignore). Use plugins to change the text (and easily write new ones).
6 weekly downloads -
JGloss Add readings and translations to Japanese text
2 weekly downloads -
JSesh JSesh is an ancient Egyptian hieroglyphic text processor, currently used by professionnals and amateurs alike. It runs on all platforms supporting java (Mac, Windows, Linux). It can be used as a library for your own softwares too.
170 weekly downloads -
Java Dict API A 100% Java client for the DICT protocol (RFC2229). This provides access to lexicons, translating dictionaries, thesauri and similar database over a TCP/IP protocol.
3 weekly downloads -
Jed Modes Repository A collection of S-Lang extension scripts (modes) for the Jed text editor, contributed by Jed users. Browse the repository at http://jedmodes.sf.net/
1 weekly downloads -
Jeddy An editor for java source files which includes support for Unicode and autoformatting.
1 weekly downloads -
Jericho HTML Parser Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
198 weekly downloads -
KOMA-Script3 KOMA-Script3 became KOMA-Script with additional features and a new optional user interface. KOMA-Script is a versatil bundle of LaTeX classes and packages. It is available from CTAN and infos may be found at The TeX Catalogue.
301 weekly downloads