MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).
Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.
Qualiweb aims at providing semantic web metrics for modeling a website visitors needs according to a given taxonomy or document classification. Web metrics provided by Qualiweb give an indication of how successful each of the website topics have been.
A web-based repository for UIMA-compliant information analysis components, with a web-based interface for humans and a plugin interface for IDEs. More information is available at this project's website.
vyasa is a digital library application that incorporates the functions of digital asset and document management systems. It facilitates information retrieval and knowledge discovery by providing comprehensive metadata generation and semantic analysis.
Azureus Plug-In that allocates the ip adresses of the peers to the country and the city they belong to and visualize that data on a world map or in statistics. This product includes GeoLite data created by MaxMind, available from http://www.maxmind.com/.
RunCC is a new kind of parsergenerator that generates parsers and lexers at runtime. Source generation is only optional. It features the absence of any cryptography. Although intended for small languages, it comes with Java and XML example parsers.
A complete survey administration and data collection system. A fully featured replacement for Quancept, supporting CAPI, Web, CATI, PDA and Paper survey modes. Sonar is the reference implementation of JCaiF for CAPI and Web survey interviewing. Try it!
Rise Vision is the #1 digital signage company, offering easy-to-use cloud digital signage software compatible with any player across multiple screens. Forget about static displays. Save time and boost sales with 500+ customizable content templates for your screens. If you ever need help, get free training and exceptionally fast support.
BabyTALK is to add another brick in the wall of natural languages learning. The baby needs to structure a corpus of texts when his tutor points and talks about a particular part of the corpus. The baby is also to describe any selected part of the corpus.
The Text Annotation Environment (tae) can be used to annotate natural language text manually or automatically (UIMA Annotator) with meta information (tokens, part-of-speech, named entities, ...). Tae is based on Eclipse and IBM's UIMA.
The Kinship Algebra Modeller is a suite of java applications that assist development of an algebra to describe a given kinship terminology, and to support models and simulations of social processes based on relating people using this algebra.
CRFChunker: Conditional Random Fields Phrase Chunker (Phrase Chunking Tool) for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (F1-score of 95.77). Chunking speed: 700 sentences/s
CRFTagger: Conditional Random Fields Part-of-Speech (POS) Tagger for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (accuracy of 97.00%). Tagging speed: 500 sentences/s.
MASQOT-GUI is an open-source, platform-independent application for two-channel microarray spot quality control. Included is a set of tools for gridding, segmentation, quantification, multivariate spot quality assessment and data visualization.
SurveyForge is a survey definition and execution tool oriented to statisticians running on JEE platform with special emphasis on data entry made easy, use of existing standards (Triple-S, Metanet, DDI) and reuse of standard (or not) classifications
PROMPT is a system for retrieval, analysis, mapping and comparison of proteins. It allows easy mapping of different types of sequence IDs, data retrieval and integration, a multitude of analysis and comparison algorithms and a full-blown easy to use GUI.
This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
The Cornell Tree-Ring Analysis System. A program for several aspects of dendrochronology: measuring, indexing, crossdating, graphing, summing masters, and even drawing maps of locations of sites.
TopicMapTools allows the creation, manipulation and visualization of XML Topic Maps. It is based on the TM4J framework. The editor can be used in genealogy for the visualisation of e.g. family trees.
A set of software components used to investigate and implement searching of music and audio. This covers content-based retrieval and meta-data based solutions, segmentation and content selection (summarisation) of audio and music (MIDI, MP3 and WAV).