Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "text analysis" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Windows 146
Linux 141
Mac 132
More...
BSD 92
ChromeOS 83
Desktop Operating Systems 5
Game Consoles 1
Server Operating Systems 1

Category

Scientific/Engineering 96
Artificial Intelligence 44
Software Development 35
Text Editors 27
Business 24
Internet 20
System 15
Multimedia 13
Education 10
Formats and Protocols 8
Database 7
Social sciences 4
Communications 3
Games 3
Religion and Philosophy 2
Mobile 1
Security 1

License

OSI-Approved Open Source 142
Creative Commons Attribution License 5
Other License 4
Public Domain 2

Translations

English 59
German 16
French 5
Brazilian Portuguese 3
More...
Russian 3
Spanish 3
Chinese (Simplified) 2
Italian 2
Japanese 2
Portuguese 2
Turkish 2
Chinese (Traditional) 1
Czech 1
Dutch 1
Greek 1
Indonesian 1
Latin 1
Polish 1
Slovak 1
Slovene 1
Swedish 1
Ukrainian 1
Vietnamese 1

Programming Language

Java 154
C++ 9
Python 8
Perl 7
C 5
More...
Unix Shell 4
XSL (XSLT/XPath/XSL-FO) 4
C# 3
JavaScript 3
Groovy 2
JSP 2
PHP 2
S/R 2
COBOL 1
Flex 1
IDL 1
Lisp 1
Prolog 1
Ruby 1
Scheme 1

Status

Beta 48
Production/Stable 45
Alpha 28
Planning 12
More...
Pre-Alpha 9
Mature 4
Inactive 4

Showing 154 open source projects for "text analysis"

View related business solutions

Java Clear Filters & Widen Search

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
1

ZBNF-parser

* Java classes for parsing text, conversion to XML or to evaluate in Java. The parser is textual-script-controlled with a syntax near Backus Naur Format, named ZBNF. * Some routines for conversion: C-Header or Java to XMI, XML-Documentation generation,

Downloads: 0 This Week

Last Update: 2014-07-07
See Project
2

DawNLITE

DawNLITE is a Natural-Language-based Image Transmoding Engine. The software transforms an image to a video as recorded by a virtual camera panning and zooming over the image, following a natural language text description of the image.

Downloads: 0 This Week

Last Update: 2013-04-18
See Project
3

Grammar Browser

Provides a GUI interface to grammatical structure and relations (as parsed by the Stanford Parser) of any text. Contains grammatical relation editor to modify, import, export grammatical relation definitions (tregex patterns and features).

Downloads: 0 This Week

Last Update: 2013-04-03
See Project
4

VoDoo/Stream

The Vodoo/Stream project let users to define transducers dedicated to document analysis. Such transducers describe how fragments are matched and transformed. Finally a document can be an XML fragment, a free text or something else depending on extensions

Downloads: 0 This Week

Last Update: 2013-04-24
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
5

FIBER

The Fiber project seeks to create a modular open source text mining tool that provides a contextual foundation for analysis in the dissemination of large quantities of text data.

Downloads: 0 This Week

Last Update: 2015-05-20
See Project
6

Trainable Relation Extraction framework

T-Rex (Trainable Relation Extraction) is a highly configurable machine learning-based Information Extraction from Text framework, which includes tools for document classification, entity extraction and relation extraction.

Downloads: 0 This Week

Last Update: 2013-05-02
See Project
7

Smart Tail

Like Unix-Tail BUT: - Runs with or without GUI - Suspend and resume tailing at runtime - Can monitor a set of Files - Print output to a textfield, stdout or file - Runs in "Grep" mode, too (Read files once) - (Almost) the same options as Unix-Tail

Downloads: 0 This Week

Last Update: 2016-06-17
See Project
8

AMA Text Tool

The main purpose of AMATOOL is to create an application for semiautomatic mark of text, using XML tags. The texts is typical can be archaeological reports or midleagetextscripts. It is a semiautomtaic editor.

Downloads: 0 This Week

Last Update: 2014-07-17
See Project
9

DuMP3 - duplicate & similar file finder

DuMP3 is a duplicate and similar file finder.

DuMP3 is a duplicate and similar file finder. It finds exact duplicate binaries by hash, similar text files by substring content, images (JPG, BMP, GIF, PNG, etc) by color and audio files (MP3, WAV, OGG, etc) by wave data. Future: fonts, video.

5 Reviews

Downloads: 0 This Week

Last Update: 2012-07-19
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

GoldenOrb

GoldenOrb is a java library under the Apache License V2.0 for correlation, summarization and clustering of text information.

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
11

iDocs

iDocs is a intellectual document work flow with text mining options project.

Downloads: 0 This Week

Last Update: 2014-04-08
See Project
12

Java Text Categorizing Library

The Java Text Categorizing Library (JTCL) is a pure java implementation of libTextCat which in turn is "a library that was primarily developed for language guessing, a task on which it is known to perform with near-perfect accuracy."

1 Review

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
13

LACE (Lucene compatible Analyzer)

LACE means "Lucene Analyzer for CJK (Chinese/Japanese/Korean) & English". It's a simple tokenizer that can handle English-CJK mixed text. Chinese words are handled using a dictionary based method.

Downloads: 0 This Week

Last Update: 2013-04-05
See Project
14

hypKNOWsys

hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)

Downloads: 0 This Week

Last Update: 2013-04-15
See Project
15

BRUTUS

The UIMA Annotator (called BRUTUS - Business Rules from Unstructured Text and Unstructured Sources) is a component for the UIMA Framework that allows for capturing business knowledge formalized in Structured English syntax (based on OMG's SBVR) with MOF

Downloads: 2 This Week

Last Update: 2013-03-22
See Project
16

BWPGazetteer

An approximate gazetteer for GATE (General Architecture for Text Engineering), based on Levenshtein's Distance. Strings can be matched and found even in texts with noise and errors. More Info: http://bruno-wp.blogspot.com/search/label/Software

Downloads: 0 This Week

Last Update: 2015-08-06
See Project
17

Jerbil

Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.

Downloads: 0 This Week

Last Update: 2013-04-17
See Project
18

bios sequential tagger

Bios is a suite of syntactico-semantico analyzers that include the most common tools needed for the shallow analysis of English text.

Downloads: 0 This Week

Last Update: 2013-03-25
See Project
19

Word Vector Tool

The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.

Downloads: 1 This Week

Last Update: 2013-04-08
See Project
20

Flesh

Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.

2 Reviews

Downloads: 2 This Week

Last Update: 2013-04-03
See Project
21

Tools for Field Linguistics

This site is devoted to the collaborative creation of tools, protocols and procedures for field linguistics and language analysis. We are especially interested in tools for annotating or manipulating text, audio and video-based language archives.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
22

Text Annotation Environment

The Text Annotation Environment (tae) can be used to annotate natural language text manually or automatically (UIMA Annotator) with meta information (tokens, part-of-speech, named entities, ...). Tae is based on Eclipse and IBM's UIMA.

Downloads: 1 This Week

Last Update: 2016-04-24
See Project
23

JTextPro: A Java Text Processing Toolkit

JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).

Downloads: 0 This Week

Last Update: 2013-03-13
See Project
24

Language Generator by Instil (LGI)

Linear time, dynamic API for lexical analysis and parser generation. Allows for a text-based specification of formal languages with the well-known regular-expressions approach, and using Parsing Expression Grammars as the underlying engine.

1 Review

Downloads: 0 This Week

Last Update: 2014-06-27
See Project
25

Namboo KDD

This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.

Downloads: 0 This Week

Last Update: 2013-03-13
See Project

Previous
1
2
3
4
You're on page 5
6
7
Next

Related Searches

grammarbrowser

document classification

duplicate

machine learning for text categorization java

dictionary

text mining

brutus-aet2

inference engine

bios

wvtool

Related Categories

Scientific/Engineering

Artificial Intelligence

Software Development

Text Editors

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise