The Text Annotation Environment (tae) can be used to annotate natural language text manually or automatically (UIMA Annotator) with meta information (tokens, part-of-speech, named entities, ...). Tae is based on Eclipse and IBM's UIMA.
CRFChunker: Conditional Random Fields Phrase Chunker (Phrase Chunking Tool) for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (F1-score of 95.77). Chunking speed: 700 sentences/s
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
CRFTagger: Conditional Random Fields Part-of-Speech (POS) Tagger for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (accuracy of 97.00%). Tagging speed: 500 sentences/s.
The Kinship Algebra Modeller is a suite of java applications that assist development of an algebra to describe a given kinship terminology, and to support models and simulations of social processes based on relating people using this algebra.
Equalculator is an equation calculator. It is currently in the "pre-alpha" stage of development, so don't expect it to do miracles. You type in your equation, it will ask you for the variables and Voila!, it gives you the answer.
Graph representation of the parapsychological (scientific) models and propositions. The graph will help to visualize the relation between the diferent hypothesis and estimates the level of confidence of a given hypothesis or model.
A collection of Matlab functions and scripts for computing the saliency map for an image, for determining the extent of a proto-object, and for serially scanning the image with the focus of attention.
Ocular is a spreadsheet written entirely in python. Cell contents are evaluated by python after any standard spreadsheet coordinates are parsed. This allows the full Monty from Python to be implemented in a visual environment.
A complete survey administration and data collection system. A fully featured replacement for Quancept, supporting CAPI, Web, CATI, PDA and Paper survey modes. Sonar is the reference implementation of JCaiF for CAPI and Web survey interviewing. Try it!
PyPotrace is a Python programming language binding for Peter Selinger's Potrace raster to vector conversion algorithm. No separate Potrace library or application required!
Narrative is a Mac OS X native plotting framework written with the Cocoa API. It is already capable (see, e.g., www.trade-strategist.com), but will evolve into a complete OO solution to plotting in Cocoa and GNUStep.
MASQOT-GUI is an open-source, platform-independent application for two-channel microarray spot quality control. Included is a set of tools for gridding, segmentation, quantification, multivariate spot quality assessment and data visualization.
Crawl a LiveJournal-based blog hosting for friends data. LiveJournal is a blog hosting engine that allows its users to list others as friends. This tool can download most of the data about friendship relations between users for later processing.
Benetech Human Rights Program Utility Library: Collection of utilities used by the Benetech Human Rights programming for processing human rights data in preparation for statistical analysis. Library handles DSV files, name canonicalization, binarray, et
SurveyForge is a survey definition and execution tool oriented to statisticians running on JEE platform with special emphasis on data entry made easy, use of existing standards (Triple-S, Metanet, DDI) and reuse of standard (or not) classifications
The project aims to make publishing of wildlife survey data easier using PHP Mapscript. By uploading survey or sightings data the user will be presented with a generalization map based on the Quarter Degree Grid Cell system.
This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.