Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "text processing" - Page 12

x

Sort By:

Relevance

Clear All Filters

OS

Mac 305
Linux 300
Windows 297
More...
BSD 130
ChromeOS 123
Mobile Operating Systems 8
Desktop Operating Systems 6
Server Operating Systems 2
Game Consoles 1

Category

Artificial Intelligence 305
Scientific/Engineering 40
Software Development 28
Text Editors 26
Business 15
Multimedia 12
Education 5
System 5
Communications 4
Formats and Protocols 4
Internet 4
Games 3
Security 3
Desktop Environment 1
Printing 1
Religion and Philosophy 1
Social sciences 1

License

OSI-Approved Open Source 257
Creative Commons Attribution License 6
Other License 2
GNU Free Documentation License 1
More...
Public Domain 1

Translations

Programming Language

Status

Production/Stable 27
Beta 23
Alpha 13
Mature 3
More...
Planning 2
Pre-Alpha 2
Inactive 2

Showing 305 open source projects for "text processing"

View related business solutions

Artificial Intelligence Mac Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
1

TextMarker

TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.

1 Review

Downloads: 3 This Week

Last Update: 2013-04-29
See Project
2

charface

...It suports automatica detection of next engines to be installed - cuneiform with its languages - tesseract with language database files - gocr Supports - adding custom engines - bach processing of images - text postprocessing

Downloads: 0 This Week

Last Update: 2015-06-25
See Project
3

Tools for Text Analytics

This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.

Downloads: 0 This Week

Last Update: 2015-11-27
See Project
4

Sanchay

Sanchay is a collection of tools and APIs for language researchers. It has some implementations of NLP algorithms, some flexible APIs, several user friendly annotation interfaces and Sanchay Query Language for language resources.

Downloads: 0 This Week

Last Update: 2013-04-11
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
5

OpenDMAP

OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.

Downloads: 0 This Week

Last Update: 2013-04-30
See Project
6

Arabic Phonetic Platform using VoiceXML

This project'll be the core engine of many voice based platforms,which can be implemented into your projects,websites...etc to provide an Arabic speech service, where your servers can interact with the clients through Arabic Speech Recognition.

Downloads: 0 This Week

Last Update: 2013-04-01
See Project
7

YagpoOCRUnicode c++library

OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
8

Lingual Quanta

The Lingual Quanta is an organization created by software engineers that are interested in Natural Language Processing technologies focused in libraries useful for projects such as grammar checkers, text markups etc.

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
9

Concrete Voice

Concrete Voice is a text to speech program. It can read the time, anounce weather, read text file, save text files to audio files, open any text file (supports all text encoding formats) and many more advance stuff!

Downloads: 0 This Week

Last Update: 2016-01-31
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

MutationFinder

MutationFinder is a biomedical natural language processing (NLP) system for extracting mentions of point mutations from free text. MutationFinder achieves high performance (99% precision, 81% recall on blind test data) as an information extraction system

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
11

iDocs

iDocs is a intellectual document work flow with text mining options project.

Downloads: 0 This Week

Last Update: 2014-04-08
See Project
12

JAIMLpad

"Java Artificial Intelligence Markup Language PAD" is a tool that manages ProgramD AI (on local or remote machines) and AIML files with real-time previews and it provides a network support to test AI capabilities over many network protocols.

Downloads: 0 This Week

Last Update: 2014-08-03
See Project
13

Auvai Text to Speech

Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
14

Jerbil

Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.

Downloads: 0 This Week

Last Update: 2013-04-17
See Project
15

Word Vector Tool

The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
16

JTextPro: A Java Text Processing Toolkit

JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).

Downloads: 0 This Week

Last Update: 2013-03-13
See Project
17

Young Researchers' Induction Foundation

Collection of Statistical Language Processing Tools and Modules for Information Retrieval, Document Classification, Vectorization, Pattern Matching, Knowledge/Text Mining related problems.

Downloads: 0 This Week

Last Update: 2013-04-17
See Project
18

eBookFormatter

Got any emails with obnoxious inline text? Long text stories with bad formatting? Files that an OCR didn't quite translate right? RTF format files and no easy way to read or modify them? Then eBookFormatter is for you!

Downloads: 0 This Week

Last Update: 2013-03-12
See Project
19

AutoSummary Semantic Analysis Engine

AutoSummary uses Natural Language Processing to generate a contextually-relevant synopsis of plain text. It uses statistical and rule-based methods for part-of-speech tagging, word sense disambiguation, sentence deconstruction and semantic analysis.

1 Review

Downloads: 0 This Week

Last Update: 2013-03-25
See Project
20

OpenOCR

OpenOCR will be a commercial quality ocr engine with tools for pre- and post-processing of images and resulting text.

Downloads: 0 This Week

Last Update: 2015-07-12
See Project
21

GTR Language Workbench

A Java application for statistical analysis and systematic manipulation of natural language texts.

Downloads: 0 This Week

Last Update: 2013-02-21
See Project
22

pySPACE

Signal Processing and Classification Environment in Python using YAML

pySPACE is a modular software for processing of large data streams that has been specifically designed to enable distributed execution and empirical evaluation of signal processing chains. Various signal processing algorithms (so called nodes) are available within the software, from finite impulse response filters over data-dependent spatial filters (e.g. CSP, xDAWN) to established classifiers (e.g.

Downloads: 0 This Week

Last Update: 2014-10-29
See Project
23

Kuto

When translating becomes a game ! Text to translate can be graphically selected. Several dictionnaries can be sorted according to the context. A large choice of matching strategies is available. The OCR engine is tunable.

Downloads: 0 This Week

Last Update: 2013-02-22
See Project
24

OBO Annotator

The OBO-Annotator is a semantic NLP tool that is designed to give its end-users a great deal of flexibility to combine any number of OBO ontologies from the OBO foundry regardless of their format and use them to annotate text-bases.

Downloads: 0 This Week

Last Update: 2014-10-08
See Project
25

mms-300m-1130-forced-aligner

CTC-based forced aligner for audio-text in 158 languages

...The alignment pipeline includes audio processing, emission generation, tokenization, and span detection, making it suitable for speech analysis, transcription syncing, and dataset creation. This model is especially useful for researchers and developers working with low-resource languages or building multilingual speech systems.

Downloads: 0 This Week

Last Update: 2025-07-02
See Project

Previous
8
9
10
11
You're on page 12
13
Next

Related Searches

cuneiform ocr

text mining

resource viewer

arabic voice

hebrew ocr

voice to text

jaimlpad

inference engine

wvtool

jtextpro

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

Text Editors

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise