One platform to build, fine-tune, and deploy ML models. No MLOps team required.
Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
ASTL Automata Standard Template Library (Vincent Le Maout - Dominique Revuz) is a set of generic and efficient C++ components for automata manipulation.
SCons is a make replacement providing a range of enhanced features such as automated dependency generation and built in compilation cache support. SCons rule sets are Python scripts so as well as the features it provides itself SCons allows you to use the full power of Python to control compilation.
This is a SCons extension (tool) which enables usage of the dvipdfm program to convert dvi files to pdf.
Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.
Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
Redundancy due to cut-paste operations in text creates bias in machine learning for NLP.
This module takes a directory and produces a subset of the files in that directory (in a list) with an upper bound on similarity between two files.
OmniHelp is a cross-platform, browser-independent, tri-pane help viewer built in pure JavaScript and CSS with HTML 4. Some functions (such as help embedding) may in the future be in Java, C, or C++; CSH is fully supported. All code is under the LGPL.
PTools is a set of useful tools written in Pascal. It includes: scientific calculator, archiver, text editor, remote adminitration and more. It is designed to be portable across operating systems, specially Java-based mobiles, Windows and Unixes.
LM - Report Asistent will be an utility, that helps dataminers, who use LISP-Miner, with writing analytical repostrs. Report Asisten will connect LISP-Miner with MS Word (R) editor.
Mavscript ermöglicht es in einem Textdokument Berechnungen durchzuführen. Die eigentliche Berechnung verarbeitet das Algebraprogramm Yacas (oder der Java-Interpreter BeanShell).
SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text.
The method is based on SVM but other ML algorithms can be adopted.
Apolda is a plugin for the Gate framework (see http://sourceforge.net/projects/gate/) that annotates texts with labels of concepts from an arbitrary OWL-ontology.
TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
transpose is a fast and efficient command-line tool that transposes (the rows become the columns, the columns become the rows) or reshapes a matrix in plain text. Inputs and outputs can be files or streams.
A lyrical analysis and classification tool focused specifically on rhyming style in rap lyrics. Functions include phonetic transcription, rhyme visualization, and rapper classification.
Java Data Processing Project aims to manipulate text file containing scientific data such as time/intensity scan or spectrum. Jdpp provide large flexibility to read all possible time duration and date format. It is originally design to treat MS data.
A C (and JavaScript) library providing predictive text functions. The API is very simple and provides dictionary autocomplete and partial/full matching. Sample cellphone-like examples are included.
CRF decoder is the simplified version of CRF++, only for decoding the sequential data. It removes the training component and its correspondent codes from CRF++, which makes CRF decoder more reabable and understandable for freshman.
The Scheme Natural Language Toolkit (S-NLTK) is a Scheme R6RS library for language and textprocessing, and various tasks related to symbolic and statistical analysis of language data.
This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
Why is it quicker to express simple equations in plaintext that it is to write the equivalent LaTeX? Easylatex is a preprocessor to make writing LaTeX much quicker. Project activity mode (http://bayleshanks.com/pamv1 ): sporadic
Sanchay is a collection of tools and APIs for language researchers. It has some implementations of NLP algorithms, some flexible APIs, several user friendly annotation interfaces and Sanchay Query Language for language resources.