IMPORTANT NOTICE: AS OF 2016-03-14, ITEXT DOWNLOADS AND SOURCE CODE WILL ONLY BE AVAILABLE ON GITHUB: http://github.com/itext iText is a PDF library that allows you to CREATE, ADAPT, INSPECT and MAINTAIN documents in the Portable Document Format (PDF): - Generate documents and reports based on data from an XML file or a database - Create maps and books, exploiting numerous interactive features available in PDF - Add bookmarks, page numbers, watermarks, and other features to existing PDF documents - Split or concatenate pages from existing PDF files - Fill out interactive forms - Serve dynamically generated or manipulated PDF documents to a web browser iText is used by Java, .NET, Android and GAE developers to enhance their applications with PDF functionality. Several iText engineers are actively supporting the project on StackOverflow: http://stackoverflow.com/questions/tagged/itext
Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.
Pdftohtml is a tool based on the Xpdf package which translates pdf documents into html format.
General-Purpose PDF Library for Java and .NET
PDF Clown is a general-purpose Java and .NET library for manipulating PDF files through multiple abstraction layers, rigorously adhering to PDF 1.7 specification (ISO 32000-1). This project aims to provide a universal access to PDF files (creation, reading, editing, rendering...) through an accurate and elegant object-oriented API. * Features: http://pdfclown.org/overview/features/ * Overview: http://pdfclown.org/overview/architecture/ * Website: http://pdfclown.org/ * Blog: http://www.pdfclown.org/blog/ * Twitter: https://twitter.com/PDFClown
Vim-like editing in Eclipse
Vrapper is an eclipse plugin which acts as a wrapper for existing eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Eclipse Update Site: http://vrapper.sourceforge.net/update-site/stable
Colorer provides source text syntax highlighting services. It colorizes source codes in editor systems (more than 200 syntaxes). Uses powerful HRC format(XML, RE, context free grammas), allowing to support any language. Available as Eclipse plugin.
The XSD editor is a cross-platform XML editor. Although it can be used to edit any type of XML file, the editor is specifically designed to allow easy creation, editing, and validation of XML Schema (XSD) files.
XML Differencing and Patching tools. XML based tools to mimic the functionality of traditional line based diff and patch utils, except operating on the hierarchical structure of XML.
Third-party plugins for OmegaT (https://sourceforge.net/projects/omegat)
A translation tools suite for Computer-Aided Translation / Computer-Assisted Translation (CAT). A translation processor with translation memory, machine translation and project support, bitext aligner/converter, TMX validator, and others.
Camomile is a Unicode library for ocaml. Camomile provides Unicode character type, UTF-8, UTF-16, UTF-32 strings, conversion to/from about 200 encodings, collation and locale-sensitive case mappings, and more.
Diff-ext is an extension for filemanagers such as Windows Explorer and Nautilus that allows to launch diff/merge tools on selected files.
regexxer is a nifty GUI search/replace tool featuring Perl-style regular expressions. If you need project-wide substitution and you're tired of hacking sed command lines together, then you should definitely give regexxer a try.
A free computer-aided translation / computer-assisted translation (CAT) tool to align and converter bitext into TMX translation memory format to be used in other CAT tools by translators and other language professionals.
Early Access iText, a PDF generation library in Java
PyRTF is a pure python module for the efficient creation of RTF documents.
(XSLT transformer/editor) A text editor that allows the loading and editing of an XML document and an XSLT document at the same time. It also can apply the XSLT to the XML and display the output for further editing/saving. Plugable XML and XSLT parsers
Queequeg is an English grammar checker for non-native English speakers.
XSL Transformation stylesheets. These stylesheets output LaTeX code from a DocBook (c) XML document. A recent version of LaTeX2e is required.
XML text markup for ancient documents
The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
Download websites as e-book: pdf, txt, epub.
This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
LaTeX package for geophysical publications
a collection of indexing and search tools for corpus linguists
DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists