JWebPro: A Java tool that can interact with Google search and then process the returned Web documents in a couple of ways. The outputs can serve as inputs for NLP, IR, infor extraction, Web mining, online social network extraction/analysis applications.
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.
Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
A visualization of the fourier transform applied to sound data. Analyzes a time-domain sound sample using the Fourier Transform, and graphically displays the results and sound in real time. Attempts to audio into sheet music.
Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.
Facelift is a visualization and analysis software for online social networking services. It displays a given community as a node-link diagram and provides several search / filtering functions as well as cluster analysis features.
It is an application for Bluetooth-enabled mobile phones that allows you to connect to other BluetunA users in range and share music recommendations. Mobile music, metadata sharing, Bluetooth applications, proximity-based interactions, social awareness
The Unstructured Information Management Architecture (UIMA) Framework is an open, scalable and extensible platform for building analytic solutions that process unstructured information to find latent meaning, relationships and relevant facts.
The Genomic Diversity and Phenotype Data Model (GDPDM) captures molecular and phenotypic diversity data. MySQL databases are used to implement the schema. This project develops software tools (written in Java, Perl, etc.) associated with this model.
The Microdata Management Toolkit is a collection of tools for documenting, disseminating and preserving survey and census microdata The project is sponsored by the International Household Survey Network with financial support from the World Bank.
The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.
Design and develop Recommendation and Adaptive Prediction Engines to address eCommerce opportunities. Build a portfolio of engines by creating and porting algorithms from multiple disciplines to a usable form. Try to solve NetFlix and other challenges.
geolocate is a front-end java program that works with google maps to provide dynamic maps to users. Combined with the flexibility of XML and the power of javascript, users can see various relationships on their map to draw conclusions.
Open Source Semantic Web Search Engine Software: If two machines anywhere on the web can agree on the same definition of a digital service or digital good, then machine to machine transactions can use this lingua franca to transact on the users behalf.
FORce based Cluster Editing (FORCE) is a Java software heuristically solving the graph cluster editing problem on weighted edges using BLAST E-values. It further provides a training mode for heuristic parameter estimation.
JVnSegmenter is a Java-based and open-source Vietnamese word segmentation tool. The segmentation model was trained on about 8,000 sentences using Conditional Random Fields (FlexCRFs). This tool would be useful for Vietnamese NLP community.
K-automaton is a new parsing (syntactic analysis) machine isomorphous to language. Implemented in Java. Can generate Java code from grammars described in EBNF.
vyasa is a digital library application that incorporates the functions of digital asset and document management systems. It facilitates information retrieval and knowledge discovery by providing comprehensive metadata generation and semantic analysis.
Bitnets instantiates and operates on graphs and subgraphs of large complex networks, such as kinship networks. Bitnets consists mainly of a java library, a number of use examples and an interactive interpreted language interface.
Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.
Qualiweb aims at providing semantic web metrics for modeling a website visitors needs according to a given taxonomy or document classification. Web metrics provided by Qualiweb give an indication of how successful each of the website topics have been.