Filecmp is a command-line application that gets two filenames as argument and outputs the comparison between them - e.g. if they are the same or not... it may look irrelevant but sometime it's very useful, specially inside scripts.
A collection of open source libraries and tools that provide solutions for common problems in processing Arabic text, especially in web applications. text normalization, phrase segmentation, text indexing, stop word lists, common spelling mistakes.
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Track changes in LaTeX documents. The goal is to provide editing facilities as known from word processors like Microsoft Word or OpenOffice Writer for LaTeX. The project comprises a LaTeX package and additional software to accept/reject changes etc.
xml2txt is a text formatter for XMl in the same way the FO is a PDF formatter. It uses python to convert an XML document to well-formatted text, wtih borders, indents, and tables.
NiMC provides an Instant Messaging server and client that runs on a custom protocol that is implemented in python. The project was written as a learning experiment by its author, to get experience with TCP/IP. Great example of how to do TCP/IP wrong
TransHelp is designed to assist in checking the consistency of Chinese-English translations in a translation project. It is written in php and python. It is especially useful in collaborative translation projects.
Java library for reading and writing of flat files. CSV, FLR (fixed length record) or mixed structures. Tree-style processing API. Adapters for SAX, Stax and XStream for transformation, data binding or serialization.
LaTeX-Mk is a collection of makefile fragments for managing small to large LaTeX
based documentation projects. The idea is that especially large documents, there may be many many steps required to typeset the document (export modified figures to postscr
A software for creative writers with some new aspects. Scene/Strand based, interactive story-devolepment in an intuitive way. Database oriented on creative writing technics. Full editor integration. Statistics. -> And a beta-reader client!
Requirement Heap is a web based requirement management /business analysis application. It allows to enter requirement in rich text, supports versioning and the management of requirements. It also handles use cases, interviews and test cases. It allows multiple projects. Stakeholders and glossaries can be handled per project or globally.
...It suports automatica detection of next engines to be installed
- cuneiform with its languages
- tesseract with language database files
- gocr
Supports
- adding custom engines
- bach processing of images
- text postprocessing
TextConverter is a graphical text editor allowing the user to encrypt/decrypt the textual contents displayed on the screen using a 128-bit AES (Advanced Encryption Standard) cipher.
Provides a set of tools for processingtext, such as text extraction and classification. Classification implementations to be implemented include: Bayesian and Statistical (N-gram).
A small Java application that helps write texts in most languages on any keyboard that supports typing at least all ASCII characters. A systray application for Windows (written in C#) is available as well.
The XSD editor is a cross-platform XML editor. Although it can be used to edit any type of XML file, the editor is specifically designed to allow easy creation, editing, and validation of XML Schema (XSD) files.
Infocard Organizer is a Java application/outliner that enables easy editing of infocards (InfoML). InfoML is a XML standard for storing and organizing "chunks" of information, along with metadata. It meets most users' needs but can also be customi
iTeXMac is an integrated software which components are:
- a text editor dedicated to D.E. Knuth's TeX typesetting system
- a frontend to a teTeX like distribution, that includes TeX engines and tools
- a PDF viewer
An Artificial Intelligence based software written in Java, deployed as an EJB / WebService application and implementing neural networks for data processing. It aims to be the brain of the web by serving text classification, mood detectors, etc.