Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Scientific/Engineering
Information Analysis Software
Search Results

Search Results for "text processing"

x

Sort By:

Relevance

Clear All Filters

OS

ChromeOS 19
BSD 19
Linux 19
More...
Mac 19
Windows 19

Category

Scientific/Engineering 19
Artificial Intelligence 10
Text Editors 10
Education 3
Internet 3
Software Development 3
Business 2
Religion and Philosophy 2
Social sciences 2

License

OSI-Approved Open Source 18
Creative Commons Attribution License 1
Other License 1

Translations

Programming Language

Java 17
C++ 2
JavaScript 1
Perl 1
More...
Prolog 1
Python 1
XSL (XSLT/XPath/XSL-FO) 1

Status

Beta 9
Alpha 4
Production/Stable 4
Planning 2
More...
Pre-Alpha 1
Mature 1
Inactive 1

19 projects for "text processing" with 2 filters applied:

Information Analysis ChromeOS Clear Filters & Widen Search

AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
1

GATE

NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.

8 Reviews

Downloads: 1 This Week

Last Update: 2026-04-07
See Project
2

Java Data Mining Package

The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).

Downloads: 0 This Week

Last Update: 2015-08-19
See Project
3

BioNLP UIMA Component Repository

The BioNLP UIMA Component Repository provides UIMA wrappers for novel and well-known 3rd-party NLP tools used in biomedical text prosessing, such as tokenizers, parsers, named entity taggers, and tools for evaluation.

Downloads: 0 This Week

Last Update: 2014-07-09
See Project
4

FALCON - Text Search Java Project

JSON based text search Java Project

----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language...

Downloads: 0 This Week

Last Update: 2014-04-18
See Project
Earn up to 16% annual interest with Nexo.
More flexibility. More control.

Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
5

HAWK - PDF Text Search Java Project

No more support for this project - TAKE A LOOK AT FALCONSEARCH

No more support for this project - TAKE A LOOK AT FALCONSEARCH "https://sourceforge.net/projects/falcontextsearch/"

Downloads: 0 This Week

Last Update: 2014-04-19
See Project
6

ASTL Automata Standard Template Library

ASTL Automata Standard Template Library (Vincent Le Maout - Dominique Revuz) is a set of generic and efficient C++ components for automata manipulation.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
7

Apolda

Apolda is a plugin for the Gate framework (see http://sourceforge.net/projects/gate/) that annotates texts with labels of concepts from an arbitrary OWL-ontology.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
8

TextMarker

TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.

1 Review

Downloads: 3 This Week

Last Update: 2013-04-29
See Project
9

YagpoOCRUnicode c++library

OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
10

iDocs

iDocs is a intellectual document work flow with text mining options project.

Downloads: 0 This Week

Last Update: 2014-04-08
See Project
11

Java Text Categorizing Library

The Java Text Categorizing Library (JTCL) is a pure java implementation of libTextCat which in turn is "a library that was primarily developed for language guessing, a task on which it is known to perform with near-perfect accuracy."

1 Review

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
12

hypKNOWsys

hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)

Downloads: 0 This Week

Last Update: 2013-04-15
See Project
13

Jerbil

Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.

Downloads: 0 This Week

Last Update: 2013-04-17
See Project
14

Flesh

Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.

2 Reviews

Downloads: 2 This Week

Last Update: 2013-04-03
See Project
15

JTextPro: A Java Text Processing Toolkit

JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).

Downloads: 0 This Week

Last Update: 2013-03-13
See Project
16

AutoSummary Semantic Analysis Engine

AutoSummary uses Natural Language Processing to generate a contextually-relevant synopsis of plain text. It uses statistical and rule-based methods for part-of-speech tagging, word sense disambiguation, sentence deconstruction and semantic analysis.

1 Review

Downloads: 0 This Week

Last Update: 2013-03-25
See Project
17

UCECS

The "Universal Content Evaluation and Categorisation Software" is a program for analysing a websites, or more generally, a texts content. The text is arranged in dozens of categories, permitting more efficient web searches and information processing.

Downloads: 0 This Week

Last Update: 2013-03-07
See Project
18

TLGView

a cross-platform application to decode, search, browse, view, print, and export TLG/PHI BetaCode texts. Project is currently being ported from wxWindows to Java. (For more info, see the project homepage at http://wxtlg.sourceforge.net)

1 Review

Downloads: 0 This Week

Last Update: 2013-04-11
See Project
19

Integradata

Integradata is a plugable, rules-based, declarative data validation system written in Java

Downloads: 0 This Week

Last Update: 2013-03-13
See Project

Previous
You're on page 1
Next

Related Searches

wordnet 2.0

gate

algorithms

text mining

war files

planar

hebrew ocr

machine learning for text categorization java

inference engine

flesh

Related Categories

Scientific/Engineering

Artificial Intelligence

Text Editors

Education

Internet

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise