Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.
Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
It is an application for Bluetooth-enabled mobile phones that allows you to connect to other BluetunA users in range and share music recommendations. Mobile music, metadata sharing, Bluetooth applications, proximity-based interactions, social awareness
The Unstructured Information Management Architecture (UIMA) Framework is an open, scalable and extensible platform for building analytic solutions that process unstructured information to find latent meaning, relationships and relevant facts.
The Genomic Diversity and Phenotype Data Model (GDPDM) captures molecular and phenotypic diversity data. MySQL databases are used to implement the schema. This project develops software tools (written in Java, Perl, etc.) associated with this model.
The Microdata Management Toolkit is a collection of tools for documenting, disseminating and preserving survey and census microdata The project is sponsored by the International Household Survey Network with financial support from the World Bank.
The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.
Design and develop Recommendation and Adaptive Prediction Engines to address eCommerce opportunities. Build a portfolio of engines by creating and porting algorithms from multiple disciplines to a usable form. Try to solve NetFlix and other challenges.
Open Source Semantic Web Search Engine Software: If two machines anywhere on the web can agree on the same definition of a digital service or digital good, then machine to machine transactions can use this lingua franca to transact on the users behalf.
FORce based Cluster Editing (FORCE) is a Java software heuristically solving the graph cluster editing problem on weighted edges using BLAST E-values. It further provides a training mode for heuristic parameter estimation.
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
geolocate is a front-end java program that works with google maps to provide dynamic maps to users. Combined with the flexibility of XML and the power of javascript, users can see various relationships on their map to draw conclusions.
JVnSegmenter is a Java-based and open-source Vietnamese word segmentation tool. The segmentation model was trained on about 8,000 sentences using Conditional Random Fields (FlexCRFs). This tool would be useful for Vietnamese NLP community.
K-automaton is a new parsing (syntactic analysis) machine isomorphous to language. Implemented in Java. Can generate Java code from grammars described in EBNF.
Bitnets instantiates and operates on graphs and subgraphs of large complex networks, such as kinship networks. Bitnets consists mainly of a java library, a number of use examples and an interactive interpreted language interface.
Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.
JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).
Qualiweb aims at providing semantic web metrics for modeling a website visitors needs according to a given taxonomy or document classification. Web metrics provided by Qualiweb give an indication of how successful each of the website topics have been.
A web-based repository for UIMA-compliant information analysis components, with a web-based interface for humans and a plugin interface for IDEs. More information is available at this project's website.
vyasa is a digital library application that incorporates the functions of digital asset and document management systems. It facilitates information retrieval and knowledge discovery by providing comprehensive metadata generation and semantic analysis.
RunCC is a new kind of parsergenerator that generates parsers and lexers at runtime. Source generation is only optional. It features the absence of any cryptography. Although intended for small languages, it comes with Java and XML example parsers.
A complete survey administration and data collection system. A fully featured replacement for Quancept, supporting CAPI, Web, CATI, PDA and Paper survey modes. Sonar is the reference implementation of JCaiF for CAPI and Web survey interviewing. Try it!
Azureus Plug-In that allocates the ip adresses of the peers to the country and the city they belong to and visualize that data on a world map or in statistics. This product includes GeoLite data created by MaxMind, available from http://www.maxmind.com/.
The Text Annotation Environment (tae) can be used to annotate natural language text manually or automatically (UIMA Annotator) with meta information (tokens, part-of-speech, named entities, ...). Tae is based on Eclipse and IBM's UIMA.
CRFChunker: Conditional Random Fields Phrase Chunker (Phrase Chunking Tool) for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (F1-score of 95.77). Chunking speed: 700 sentences/s
CRFTagger: Conditional Random Fields Part-of-Speech (POS) Tagger for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (accuracy of 97.00%). Tagging speed: 500 sentences/s.