Parser generator to read, process, or translate structured text
Stanford CoreNLP, a Java suite of core NLP tools
One hundred command line tools in a small and portable binary.
ANT4DOCBOOK is an ANT task for DOCBOOK
GNU sed with PCRE2 regexp
XML text markup for ancient documents
EBook Generation Tools - scripts to create ebook formats EPUB, DOC
An editor for structured documents
a collection of indexing and search tools for corpus linguists
This project is a quick way of applying macros to a portion of text.
Simple SQL-like syntax on top of Perl text processing
PDF Library for Developers
pykte is a simple text editor with support for unicode.
A command line tool to extract, transform and get metadata for ISBNs