The Parenthesis Classifier takes the contents of a set of parentheses and classifies it into one of several categories. It includes a parenthesized-data extractor and the classifier.
HanNanum is a Korean Morphological Analyzer and POS Tagger. A plug-in component-based architecture is adapted to the new Java version for flexible use. You can find the work flow for morphological analysis, POS tagging, noun extraction, etc.
Contact:
kschoi@kaist.ac.kr
hjjeong@world.kaist.ac.kr
jWords is a port of WORDS (by William Whitaker, a free latin-to-english dictionary program written in Ada), to Java. Besides the dictionary will be translated to the German language.
The Simple Semantic Classifier classifies short chunks of natural language text into broad semantic classes that correspond to the OBO ontologies provided as input.
A free Spanish - English Translator for Linux. It will translate a phrase (via internet) or single word (built-in dictionary.) Has capability to learn new words and is smart enough to find plural and feminine words. Written using Python/GTK under GPL
CORPSE (CORPus SEarch) is a powerful search engine written in Java. The aim is to provide an efficient implementation of a word level inverted index search with various cool functions that can be used on very large corpora.
Java program to create a (potentially multilingual) glossary of the unique words in any given Lojban text.
Note that the Sourceforge page for this was superceded by the Bitbucket repository: https://bitbucket.org/pretoriusjf/vlastezba/overview
Any further updates will be made there.
A linguistic tool to aid in the study of Linguistics/Phonology, specifically distinctive features of possible language sounds. Comprised of both a Visual C++ .NET version as well as a Java based web applet version. The C++ version has all but been ab
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
LexBase is a configurable lexical database manager. It reads lexical and semantic information from WordNet, allows flexible querying of the database, and supports programmatic addition and deletion of terms, word senses, and relations.
BD-1 is a configurable database manager designed to provide efficient search and natural representations of annotated text, storing key-value pairs, triples, or n-tuples of text or binary data. It runs memory-resident or from disk.
Migrado a GithUB EN: https://github.com/christiangda/es-ve
La mejor opción para verificar y corregir la gramática de tus documentos de LibreOffice escritos en español. La extensión incluye: Corrector Ortográfico, Tesauro de Sinónimos y Separación Silábica.
¡Hecho en Venezuela!
Sus principales características son:
Más DE 87.000 lemas y sus respectivas conjugaciones.
Contiene el Lemario actualizado de RAE (Real Academia Española)
Términos financieros e informáticos...
NooJ is used by linguists to describe linguistic phenomena and apply the formalized morphological, syntactic or semantic rules to corpora . It is used by non linguists in fields like psychology, sociology, history, literature studies as well.
PyAnnotation is a Python Library to access and manipulate linguistically annotated corpus files. Supported file formats are Kura XML, Elan XML and Toolbox files. A Corpus Reader API is provided to support statistical analysis within the NLTK.
WordNetLMF converts WordNet (http://wordnet.princeton.edu/) lexicographer files into KYOTO-LMF, the LMF dialect used in the KYOTO project (http://www.kyoto-project.eu/).
Editor for formal grammars. Attempts to be universal – customizable for any grammatical formalism and any syntax. Provides features such as syntax checking and highlighting, transformations (refactoring) and advanced rule editor.
Vtgrep stands for Visual Tree Grep and is a GUI to tgrep. It allows the user to build graphical representations of tree structures and then translates them into the tgrep syntax. provides search functionality, as well as search and result logging.
CIDIAN is a very simple offline Chinese-English dictionary written in Gambas2. Lookup any character or an entire text.Almost 100000 entries. Based on the CC-CEDICT project.
Coptic - English and Coptic - Czech dictionary related to Crum's coptic dictionary, written in C++, based on MySql, with Qt GUI. Is developed as part of project Marcion, containing only coptic data without study environment.