The free computer aided translation (CAT) tool for professionals
OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
This application allows user to download chapters from website in 3 ways:
- from table of contents;
- from range: first chapter address, last chapter address;
- by crawling from first chapter to n;
In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
Analyze text. Diagonal read subject, predicate, obj. Search other pdf.
...
- Divide plain text: subject, predicate, object.
- Count words: stemming.
- Search for similar content: pdf's.
Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic language detection.
iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK.
With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more.
The latest...
JPDF Tools is a GUI java program built on the JPDF Export library. Its main aim is to create pdf files by inserting texts, images or tables.
Users can also merge PDF files, split PDF files, merge images into PDF files and soon convert from and to PDF files.
JCopist is a template-based document generation server based on OpenOffice.org.
Its templates are regular OpenDocuments enhanced with the FreeMarker scripting language.
A wide range of formats are available, eg. : ODT, PDF, RTF, HTML, MS Word, MS Excel
It's a free and handy text editor for both plain text files and formatted text files and printing. Since release 1.2.0 there is a tool for the conversion in PDF format. It's written in Java language so it's available for a lot of Operative Systems.
Tubaina is a book generator. Given a text written in afc syntax, a markup language, an html or pdf output is generated. This project has been moved to Github: http://github.com/caelum/tubaina
JSESOFT-DB2PDF provides a transformator for a limited (but expanding) subset
of DocBook to PDF. The transformation from DocBook is done via iText directly
to PDF. Priority is given to predictability and stability rather than to
completeness.