Page 3 | text analysis free download

Java Text Categorizing Library

The Java Text Categorizing Library (JTCL) is a pure java implementation of libTextCat which in turn is "a library that was primarily developed for language guessing, a task on which it is known to perform with near-perfect accuracy."

1 Review

Downloads: 0 This Week

Last Update: 2013-04-08

See Project

LACE (Lucene compatible Analyzer)

LACE means "Lucene Analyzer for CJK (Chinese/Japanese/Korean) & English". It's a simple tokenizer that can handle English-CJK mixed text. Chinese words are handled using a dictionary based method.

Downloads: 0 This Week

Last Update: 2013-04-05

See Project

hypKNOWsys

hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)

Downloads: 0 This Week

Last Update: 2013-04-15

See Project

BRUTUS

The UIMA Annotator (called BRUTUS - Business Rules from Unstructured Text and Unstructured Sources) is a component for the UIMA Framework that allows for capturing business knowledge formalized in Structured English syntax (based on OMG's SBVR) with MOF

Downloads: 3 This Week

Last Update: 2013-03-22

See Project

BWPGazetteer

An approximate gazetteer for GATE (General Architecture for Text Engineering), based on Levenshtein's Distance. Strings can be matched and found even in texts with noise and errors. More Info: http://bruno-wp.blogspot.com/search/label/Software

Downloads: 0 This Week

Last Update: 2015-08-06

See Project

Jerbil

Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.

Downloads: 0 This Week

Last Update: 2013-04-17

See Project

Word Vector Tool

The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.

Downloads: 1 This Week

Last Update: 2013-04-08

See Project

Flesh

Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.

2 Reviews

Downloads: 1 This Week

Last Update: 2013-04-03

See Project

JTextPro: A Java Text Processing Toolkit

JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).

Downloads: 0 This Week

Last Update: 2013-03-13

See Project

Text Annotation Environment

The Text Annotation Environment (tae) can be used to annotate natural language text manually or automatically (UIMA Annotator) with meta information (tokens, part-of-speech, named entities, ...). Tae is based on Eclipse and IBM's UIMA.

Downloads: 0 This Week

Last Update: 2016-04-24

See Project

Namboo KDD

This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.

Downloads: 0 This Week

Last Update: 2013-03-13

See Project

AlgebraSql

This Software takes an text with algebra symbols and converts it to real sql, which then may be run with an extra database backend like mysql or postgresql.

Downloads: 0 This Week

Last Update: 2013-04-17

See Project

Sentensa

SENTENSA Knowledge Miner is a platform independent tool for searching any text. SENTENSA uses robust methods of indexing and searching text, leveraging on experience from more than 20 years of information retrieval.

Downloads: 0 This Week

Last Update: 2013-04-05

See Project

DotPlot

DotPlot is an Eclipse plug-in to graphically compare word sequences of any type of text. Matches will be plotted as dots on a graph. Similarities in thousands of lines of text or code will result in typical textures and diagonals in the plot.

Downloads: 0 This Week

Last Update: 2013-05-03

See Project

thetis

Thetis is a Java (OS-independent) application written to allow the linguistic and statistical analysis of the Homeric and Hesiodic Epic. Current achievement is the creation of a complete (and free) Thesaurus of the Homeric and Hesiodic poems.

Downloads: 0 This Week

Last Update: 2013-03-20

See Project

AutoSummary Semantic Analysis Engine

AutoSummary uses Natural Language Processing to generate a contextually-relevant synopsis of plain text. It uses statistical and rule-based methods for part-of-speech tagging, word sense disambiguation, sentence deconstruction and semantic analysis.

1 Review

Downloads: 0 This Week

Last Update: 2013-03-25

See Project

Interlingua Translator for Java

It is an universal language translator and written in Java. All languages are translated to an unique language (interlingua) and generate any native language from the interlingua. The wordbooks are XML. It use the context of a text, rules and a grammar.

Downloads: 0 This Week

Last Update: 2013-03-13

See Project

Text Processing Tool Kit

A tool kit for multiplexing annotations and management of features for textual annotation.

Downloads: 0 This Week

Last Update: 2014-04-06

See Project

UCECS

The "Universal Content Evaluation and Categorisation Software" is a program for analysing a websites, or more generally, a texts content. The text is arranged in dozens of categories, permitting more efficient web searches and information processing.

Downloads: 0 This Week

Last Update: 2013-03-07

See Project

MMOpenGraph

MMOpenGraph is a set of JAVA-Classes to represent graphs within java. It can load and save graphs to serialized or text-based files and analyze graphs to find shortest paths.

Downloads: 0 This Week

Last Update: 2013-03-07

See Project

TLGView

a cross-platform application to decode, search, browse, view, print, and export TLG/PHI BetaCode texts. Project is currently being ported from wxWindows to Java. (For more info, see the project homepage at http://wxtlg.sourceforge.net)

1 Review

Downloads: 0 This Week

Last Update: 2013-04-11

See Project

Stynalyser

Stynalyser is a tool that analyses the style of a text. It extracts several variables measuring the text style and displays the results. Export to other analysis programs possible. Programmed in Perl and Java.

Downloads: 0 This Week

Last Update: 2013-03-20

See Project

Integradata

Integradata is a plugable, rules-based, declarative data validation system written in Java

Downloads: 0 This Week

Last Update: 2013-03-13

See Project

ContextTree

A ContextTree is a way of dynamically forming relationships between information: the same information can be viewed in different ways, depending on what you want from it.

Downloads: 0 This Week

Last Update: 2013-02-22

See Project

OpenQDA

Free Tool for qualitative data analysis.

Downloads: 0 This Week

Last Update: 2016-04-25

See Project

Search Results for "text analysis" - Page 3

Showing 78 open source projects for "text analysis"

Java Text Categorizing Library

LACE (Lucene compatible Analyzer)

hypKNOWsys

BRUTUS

BWPGazetteer

Jerbil

Word Vector Tool

Flesh

JTextPro: A Java Text Processing Toolkit

Text Annotation Environment

Namboo KDD

AlgebraSql

Sentensa

DotPlot

thetis

AutoSummary Semantic Analysis Engine

Interlingua Translator for Java

Text Processing Tool Kit

UCECS

MMOpenGraph

TLGView

Stynalyser

Integradata

ContextTree

OpenQDA

Search Results for "text analysis" - Page 3

Showing 78 open source projects for "text analysis"

Java Text Categorizing Library

LACE (Lucene compatible Analyzer)

hypKNOWsys

BRUTUS

BWPGazetteer

Jerbil

Word Vector Tool

Flesh

JTextPro: A Java Text Processing Toolkit

Text Annotation Environment

Namboo KDD

AlgebraSql

Sentensa

DotPlot

thetis

AutoSummary Semantic Analysis Engine

Interlingua Translator for Java

Text Processing Tool Kit

UCECS

MMOpenGraph

TLGView

Stynalyser

Integradata

ContextTree

OpenQDA

Related Searches

Related Categories