creole/c is a Wiki Creole parser and a HTML converter. It implements Wiki Creole 1.0 and almost all of its additions. The parser is written in C++ and has a simple event driven plain C API. The converter is a stand-alone console application.
The Scheme Natural Language Toolkit (S-NLTK) is a Scheme R6RS library for language and textprocessing, and various tasks related to symbolic and statistical analysis of language data.
This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
BBLib - The fast and easy library written on PHP. The library makes fast parsing and replacement BB-Tags. It is really easy to customise and add own tags. Supports simple-closing and autolinks. Smiles cooming soon...
Why is it quicker to express simple equations in plaintext that it is to write the equivalent LaTeX? Easylatex is a preprocessor to make writing LaTeX much quicker. Project activity mode (http://bayleshanks.com/pamv1 ): sporadic
jSmaTeP - JAVA Smart Text Processor Library for processing import and export data by configuring data structure and generating a value object representation for a described record. ()
Sanchay is a collection of tools and APIs for language researchers. It has some implementations of NLP algorithms, some flexible APIs, several user friendly annotation interfaces and Sanchay Query Language for language resources.
OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
KeyBindings is a plugin for gedit written in python. It lets you use (and eventually define) key combinations for usual actions such as switching tabs, closing tabs, etc.
PDML is an informal markup language written in PHP that is similar to HTML. It allows for the creation of complex PDF documents and can also be used in conjunction with PHP, to define templates which can generate dynamic PDF documents.
Provide a robust and efficient implementation of n-gram based classifiers to Java. N-Gram algorithms have shown to be surprisingly good at tasks like guessing the language/encoding from an arbitrary text file. And there are many more applications.
Command line spell checking script that checks single word. User doesn't have to open file or word processor just to check single word. Simply type: "wordcheck <word>" where <word> is the word you want to check. Uses GNU Aspell.
Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
Simple text editor for editing multiple language files in parallel. Corresponding translations are automatically highlighted in order to aid reviewing against multiple sources. Supports gettext PO and key = value format. Highly recommended for polyglots.
This project'll be the core engine of many voice based platforms,which can be implemented into your projects,websites...etc to provide an Arabic speech service, where your servers can interact with the clients through Arabic Speech Recognition.