Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Scientific/Engineering
Information Analysis Software
Search Results

Search Results for "text processing"

x

Sort By:

Relevance

Clear All Filters

OS

BSD 24
Linux 24
Windows 21
More...
Mac 20
ChromeOS 19
Desktop Operating Systems 1

Category

Scientific/Engineering 24
Text Editors 14
Artificial Intelligence 10
Software Development 4
Business 3
Education 3
Internet 3
Religion and Philosophy 2
Social sciences 2
Formats and Protocols 1

License

OSI-Approved Open Source 23
Creative Commons Attribution License 1
Other License 1

Translations

Programming Language

Java 17
C++ 3
C 2
Perl 2
More...
Python 2
C# 1
JavaScript 1
Prolog 1
Unix Shell 1
XSL (XSLT/XPath/XSL-FO) 1

Status

Beta 10
Production/Stable 7
Alpha 4
Planning 2
More...
Pre-Alpha 1
Mature 1
Inactive 1

24 projects for "text processing" with 2 filters applied:

Information Analysis BSD Clear Filters & Widen Search

Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

GATE

NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.

8 Reviews

Downloads: 1 This Week

Last Update: 2026-04-07
See Project
2

Java Data Mining Package

The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).

Downloads: 0 This Week

Last Update: 2015-08-19
See Project
3

Text Analyzer

Text analyzing software

An application developed in C using the list and the AVL tree data structures, which analyzes a text (.txt file) giving the following information as an output: 1. the total occurrences of every word in the text 2. the exact line of every occurrence of every word 3. the exact position in the line of every occurrence of every word 4. the exact paragraph of every occurrence of every word 5. the exact sentence of every occurrence of every word The output is also written in a...

Downloads: 0 This Week

Last Update: 2014-11-05
See Project
4

BioNLP UIMA Component Repository

The BioNLP UIMA Component Repository provides UIMA wrappers for novel and well-known 3rd-party NLP tools used in biomedical text prosessing, such as tokenizers, parsers, named entity taggers, and tools for evaluation.

Downloads: 0 This Week

Last Update: 2014-07-09
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

FALCON - Text Search Java Project

JSON based text search Java Project

----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language...

Downloads: 0 This Week

Last Update: 2014-04-18
See Project
6

HAWK - PDF Text Search Java Project

No more support for this project - TAKE A LOOK AT FALCONSEARCH

No more support for this project - TAKE A LOOK AT FALCONSEARCH "https://sourceforge.net/projects/falcontextsearch/"

Downloads: 0 This Week

Last Update: 2014-04-19
See Project
7

ASTL Automata Standard Template Library

ASTL Automata Standard Template Library (Vincent Le Maout - Dominique Revuz) is a set of generic and efficient C++ components for automata manipulation.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
8

Tabulator

A set of Unix command line tools for quick and convenient batch processing of tabular text files (a.k.a., tab-delimited, csv, or flat file format) with a header line. Provides delimiter and compression detection, column reference by name. * tblmap: per-line ("map") computation: derive columns through an expression, delete, reorder, filter rows. * tblred: compute ("reduce") aggregations (e.g., sum, average) over groups defined by key columns

Downloads: 0 This Week

Last Update: 2013-04-25
See Project
9

Apolda

Apolda is a plugin for the Gate framework (see http://sourceforge.net/projects/gate/) that annotates texts with labels of concepts from an arbitrary OWL-ontology.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
10

TextMarker

TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.

1 Review

Downloads: 3 This Week

Last Update: 2013-04-29
See Project
11

YagpoOCRUnicode c++library

OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
12

GigaWordCorpus

We are using a large archive of newspaper stories(GigaWordCorpus) as input to a parallel MPI program, and produce from that a list of top R terms of varying lengths M through N that are especially interesting. The program is done in C using MPI.

Downloads: 0 This Week

Last Update: 2013-03-26
See Project
13

n-squared text editor

n-squared is a light weight, super powered note pad application that stores notes in an embedded database for easy searching. It has a tabbed interface, syntax highlighting, encryption, and more!

Downloads: 0 This Week

Last Update: 2016-02-03
See Project
14

iDocs

iDocs is a intellectual document work flow with text mining options project.

Downloads: 0 This Week

Last Update: 2014-04-08
See Project
15

Java Text Categorizing Library

The Java Text Categorizing Library (JTCL) is a pure java implementation of libTextCat which in turn is "a library that was primarily developed for language guessing, a task on which it is known to perform with near-perfect accuracy."

1 Review

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
16

hypKNOWsys

hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)

Downloads: 0 This Week

Last Update: 2013-04-15
See Project
17

Jerbil

Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.

Downloads: 0 This Week

Last Update: 2013-04-17
See Project
18

Flesh

Flesh is a Java application designed to analyze a document (plain text, rich text, Word documents, and PDFs) and display the difficulty associated with comprehending using the Flesch-Kincaid Grade Level and the Flesch Reading Ease Score.

2 Reviews

Downloads: 2 This Week

Last Update: 2013-04-03
See Project
19

JTextPro: A Java Text Processing Toolkit

JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).

Downloads: 0 This Week

Last Update: 2013-03-13
See Project
20

AutoSummary Semantic Analysis Engine

AutoSummary uses Natural Language Processing to generate a contextually-relevant synopsis of plain text. It uses statistical and rule-based methods for part-of-speech tagging, word sense disambiguation, sentence deconstruction and semantic analysis.

1 Review

Downloads: 0 This Week

Last Update: 2013-03-25
See Project
21

LineSort for Mac OS X

LineSort is a plug-in tool that adds advanced sorting capabilities to BBEdit, the Macintosh text editor from Bare Bones Software.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
22

UCECS

The "Universal Content Evaluation and Categorisation Software" is a program for analysing a websites, or more generally, a texts content. The text is arranged in dozens of categories, permitting more efficient web searches and information processing.

Downloads: 0 This Week

Last Update: 2013-03-07
See Project
23

TLGView

a cross-platform application to decode, search, browse, view, print, and export TLG/PHI BetaCode texts. Project is currently being ported from wxWindows to Java. (For more info, see the project homepage at http://wxtlg.sourceforge.net)

1 Review

Downloads: 0 This Week

Last Update: 2013-04-11
See Project
24

Integradata

Integradata is a plugable, rules-based, declarative data validation system written in Java

Downloads: 0 This Week

Last Update: 2013-03-13
See Project

Previous
You're on page 1
Next

Related Searches

wordnet 2.0

gate

algorithms

text mining

word list

war files

planar

hebrew ocr

gigaword corpus

machine learning for text categorization java

Related Categories

Scientific/Engineering

Text Editors

Artificial Intelligence

Software Development

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise