Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.
Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Moara is a biological text mining tool and consists of a Java library and some auxiliary MySQL databases for gene/protein training and extraction of mentions and its further normalization and disambiguation.
.NET wrapper for 7Zip, based on 7-Zip-JBinding, produced with IKVM. Extracts (password protected, multi-part) 7z Zip Rar Tar Split Lzma Iso HFS GZip Cpio BZip2 Z Arj Chm Lhz Cab Nsis Deb Rpm Udf archives. Archive creation/more formats coming soon.
OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
Cairo (Complex Archive Ingest for Repository Objects) is a tool for processing digital archives prior to submitting them to archival storage for long-term preservation; among other features, this includes format identification and metadata extraction.
Web Time Entry is a Web based project time entry and client billing system. It can track, approve, and manage employee time incurred for projects and clients you define. It can schedule projects to employees and track time incurred on project schedul
The Cornell Web Lab Collaboration Server is a suite of tools and services for GUI-based extraction, analysis and sharing of archived web data. See http://weblab.infosci.cornell.edu/ and http://www.cs.cornell.edu/~weigel for details about the project.
JOcrad is a graphical frontend for GNU/Ocrad written in Java.
GNU Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method.JOcrad supports italian and english languages, JPG,PNG and GIF images.
The Hanalyzer is a tool designed to help biologists explain results observed in genome-scale experiments and to generate new hypotheses. It combines information extraction, semantic data integration, reasoning, and visualization.
User friendly PDB (Protein Data Bank) file editor with graphic user interface for protein crystallographers to expedite selective parallel edit / data extraction / analysis of their PDB files
Scan, the Semantic Content ANnotator, is a semantic pipeline that helps connecting information extraction tools to semantic database. UIMA-based, it allows easy plugin-writing: information extraction, ontology control, store in RDF Repositories.
T-Rex (Trainable Relation Extraction) is a highly configurable machine learning-based Information Extraction from Text framework, which includes tools for document classification, entity extraction and relation extraction.
Extraction of binding sites of ligands and search for unknown binding sites on apoproteins. Some call it docking. Some call it binding site detections. Some call it knowledge based.
MutationFinder is a biomedical natural language processing (NLP) system for extracting mentions of point mutations from free text. MutationFinder achieves high performance (99% precision, 81% recall on blind test data) as an information extraction system
jumbles (Java Unified Metadata Basic Library for Extracting and Storing) is a library that enables the extraction and storing of multimedia metadata. Currently wraps "jaudiotagger" (MP3 ID3 tags) and "metadata extractor" (EXIF, et al.).
OMEN (On-demand Metadata Extraction Network) is a tomcat based system that allows external users (music information researchers) to request from participating libraries the extraction of features from music archives without violating copyright.
JWebPro: A Java tool that can interact with Google search and then process the returned Web documents in a couple of ways. The outputs can serve as inputs for NLP, IR, infor extraction, Web mining, online social network extraction/analysis applications.
<IMG SRC="http://sourceforge.net/dbimage.php?id=3181" width="256" height="34"><BR>Java Extraction Transformation Service for Transmitting Records & Exchanging Application Metadata: a Java-based ETL/EAI tool.
This project aims to distribute a facial animation system with speech, developed to brazilian portuguese case. This system is composed by many modules: movement extraction, facial animation and speech, through a text-to-speech system.
StringExtractor replaces String literals with the usage of an a configurable JAVA ResourceBundle (see. current jdk api docs at java.sun.com). This works with unlimited number of files at a time. Extraction is done due to a semimanual selection.