HanNanum is a Korean Morphological Analyzer and POS Tagger. A plug-in component-based architecture is adapted to the new Java version for flexible use. You can find the work flow for morphological analysis, POS tagging, noun extraction, etc.
Contact:
kschoi@kaist.ac.kr
hjjeong@world.kaist.ac.kr
ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.
A cross-platform Java program to turn a simple Geiger counter into the equivalent of a state-of-the art digital model, plus some – using your laptop computer.
Cougar Squared is a new Java library for machine learning and data mining research, supporting research needs of the community. It is written by researchers for researchers. It extends the WEKA and YALE machine learning frameworks.
The system searches synonyms (and related words) in Wikipedia. WikIDF generates index database of Wikipedia (for Russian, English, and German). The continuation of this project is "wikokit" at code.google.com
Executable program that measures sizes and other properties of colonies arrayed in a grid format (intended for 768, 384, or 96 colonies on agar plates) from jpeg images
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.
Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
The NITE XML Toolkit supports the creation, analysis, and browsing of annotated multimodal, text, or spoken language corpora, and represents both timing and rich linguistic structure. It contains libraries for developers and some end user tools.
Regexp testing tool allows to apply group of regexps to huge arrays of data (millions or so) in order to investigate search or search/replacement possibilities of regexp group.
library for capturing, storing and visualizing timeseries data
The JTimeSeries has moved to github
Please go to https://github.com/JTimeSeries/jtimeseries
The SourceForge copy has not been maintained since Sep 2012
A java library to assist with capturing and storing timeseries data/metrics. Provides facilities to publish timeseries data across a network, a lightweight server to persist series data, and client user interface components for real time visualization
JGraph is the most powerful, lightweight, feature-rich, and thoroughly documented open-source graph component available for Java. See the project homepage at www.jgraph.com for information and downloads.
A lyrical analysis and classification tool focused specifically on rhyming style in rap lyrics. Functions include phonetic transcription, rhyme visualization, and rapper classification.
Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
Optex Analyzer is a software to analyze and compare algorithms to solve approximately optimization problems. It has a GUI that allows select a set of input files containing raw algorithm results. The analysis is shown with tables and charts.
The PARSEC CEE is the primary achievement of several years of effort at NASA's Marshall Space Flight Center. The CEE was developed to allow engineers in the Advanced Concepts Department to rapidly prototype launch vehicle and spacecraft concepts.
GmanDA is a GPL software for performing qualitative data analysis on mailing-lists and mboxes. It was developed to ease the work with large scale mailing-list archives taken from Gmane.org
DimReduction project provide an open-source multiplatform (Java) graphical environment for bioinformatics problems that supports many feature selection algorithms, pattern recognition techniques, criterion functions and graphic visualization tools.
Blogspread is an open platform for the developmend of applications that analyze data from websites like blogs and forums. Blogspread is jointly developed by the University of Mannheim, Germany, the UFPE, Recife, Brasil and the UFAL, Maceio, Brasil.
SLEDRIDE: Simplified Learning about Expression Data Running in a Desktop Environment. To provide a general workbench for pipe-lining microarray gene expression data from supervised learning results into unsupervised learning methods.
JMulTi is an interactive software designed for univariate and multivariate time series analysis. It has a Java graphical user interface that uses an external engine for statistical computations. It uses the framework JStatCom.