MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Web-as-corpus tools in Java.
* Simple Crawler (and also integration with Nutch and Heritrix)
* HTML cleaner to remove boiler plate code
* Language recognition
* Corpus builder
This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
JUNG provides a common and extendible language for the modeling, analysis, and visualization of data that can be represented as a graph or network.
New version now available on GitHub: https://github.com/jrtom/jung/releases/tag/jung-2.1
Full access to Enterprise features. No credit card required.
What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
NLP4J library is a toolset written in Java for Natural Language Processing. This version is oriented to Document Classification and uses Naive Bayes, TF-IDF, etc. There are also pre-processing tools.
Thank you for your interest in Speedy Composer. Speedy Composer is an automated application for composing melodies for Speedy Net members. We recently made changes to the source code of Speedy Net, and converted it into the Python language and Django framework. Since Speedy Composer was originally written in PHP, it is not adapted to work with Speedy Net in its current form. So unfortunately we were forced to temporarily close the app Speedy Composer. But don't worry, we kept backups of all...
OpenRLL is a Clojure-based, open-source reimplementation of the interesting bits of Russell Greiner and Douglas Lenat's Representation Language Language.
WebAPSEE is an open source project which aims to provide automated support for software process management. The users list is: http://groups.google.com/group/webapsee-usuarios
Augmented Syntax Diagrams are an alternative to natural language grammar rules expressed as strings. They are also suitable for left-corner bottom-up parsing. Prof. Jim Mason developed an editor (ASDEditor) in Java to create and edit ASD grammars.
Facilitates data mining/natural language processing experiments to be executed on weblogs, such as classification, clustering and rating. As part of these experiments, it is possible to apply Latent Semantic Analysis.
NLPTools-ES is a Spanish plugin for GATE (General Architecture for Text Engineering). It includes a tokenizer, sentence splitter, gazetteer, pos tagger.
A modular language generator based on the theory of Functional Grammar (FG) by Simon C. Dik. Implemented using Java for the user interface, ANTLR for the input format parser and Prolog for the grammar and lexicon module, treating underlying linguistic st
ANNJ, Another Neural Network for Java is a neural network framework for the Java programming language. It is still in an early development stage, currently supporting only feed-forward type networks, but will soon be able to handle many other types.
PDDL4J is an open source library to facilitate java implementation of planners based on the PDDL (Planning Description Language). The library contains a parser on the last version of PDDL 3.0 and an example of graphplan implementation.
A voice recognition application which detect Quran (Islamic Holy book) pronunciation and result in a detection success percentage for each word of the Quran statements, when complete it can be considered a base for Arabic language recognition.
This project aims to develop a parser for Finnish. The parser and the system of grammatical competence assumed herein, are based on the minimalist theory of grammar (Chomsky 1995, 2005) and Bare Lexicon Hypothesis (Brattico, in prep).
proGPUKLT is a library for the Processing programming language and environment that wraps a GPU-implementation of the Kanade-Lucas-Tomasi feature tracker used for computer vision applications.
OO jDREW is an open source deductive reasoning engine for the RuleML web rule language. OO jDREW implements the object oriented extensions to RuleML which include: Order Sorted Types, Slots, and Object identifiers.
QuickAI (pronounced, "quickeye", or just "Quick" for short) is a return to the fundamental goals of creating an artificial intelligence. The priorities are to implement core models of knowledge and knowing, a reasoning engine, and a simple interface.
TBLTools is a set of GATE processing resources that implements the Fast Transformation Based Learning Algorithm. You can train it to learn rules for NLP tasks such as Named Entity Recognition and Shallow parsing.
JWNL is a Java API for accessing the WordNet relational dictionary. WordNet is widely used for developing NLP applications, and a Java API such as JWNL will allow developers to more easily use Java for building NLP applications.