Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "text recognition" - Page 4

x

Sort By:

Relevance

Clear All Filters

OS

Windows 96
Linux 95
Mac 84
More...
BSD 38
ChromeOS 37
Mobile Operating Systems 4
Server Operating Systems 1

Category

Artificial Intelligence 95
Software Development 7
Formats and Protocols 6
Multimedia 6
Education 4
Business 3
Communications 2
Internet 2
System 2
Text Editors 2
Database 1
Religion and Philosophy 1
Scientific/Engineering 1
Social sciences 1

License

OSI-Approved Open Source 100
Creative Commons Attribution License 1
GNU Free Documentation License 1

Translations

English 8
German 3
Chinese (Simplified) 1
Japanese 1
More...
Korean 1

Programming Language

Python 107
JavaScript 6
C++ 4
Java 3
BASIC 2
More...
C# 2
Ruby 2
Visual Basic 2
C 1
Common Lisp 1
Delphi/Kylix 1
Kotlin 1
Perl 1
PHP 1
TypeScript 1

Status

Production/Stable 8
Alpha 5
Beta 3

Showing 107 open source projects for "text recognition"

View related business solutions

Python Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

bitfarm-Archiv Document Management - DMS

bitfarm-Archiv is a powerful Document Management (DMS), Enterprise Content Management (ECM) and Knowledge Management System (KMS) with Workflow Components. Help us! As we live in the internet age, the best thing, you can help, is to write a short statement about your scenario and your use of the DMS, along with your experiences and put it on your own website or in a blog or forum. It would help us best, if you can also add a hyperlink to our site http://www.bitfarm-archiv.com. By this...

11 Reviews

Downloads: 15 This Week

Last Update: 3 days ago
See Project
2

Maia

MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.

Downloads: 1 This Week

Last Update: 2024-04-21
See Project
3

Autolabel

Label, clean and enrich text datasets with LLMs

Autolabel is a Python library to label, clean and enrich datasets with Large Language Models (LLMs). Autolabel data for NLP tasks such as classification, question-answering and named entity recognition, entity matching and more. Seamlessly use commercial and open-source LLMs from providers such as OpenAI, Anthropic, HuggingFace, Google and more.

Downloads: 4 This Week

Last Update: 2023-10-16
See Project
4

Image to Text

Convert an image to text to spot intelligible words.

The program will convert to text an image, such as a photo , with the purpose of analyzing it to spot intelligible words. Use the program with photos of clouds, sea, soil, vegetation or any other photo of natural or man-made semi-homogeneous configuration, to reveal the hidden universal-philosophical messages of the image. You can also use it on photos of people or art pieces to have a psychological insight of the person portrayed or of the image author. The resulting text will be a long...

Downloads: 0 This Week

Last Update: 2024-11-21
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

MMOCR

OpenMMLab Text Detection, Recognition and Understanding Toolbox

MMOCR is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the corresponding downstream tasks including key information extraction. It is part of the OpenMMLab project. The toolbox supports not only text detection and text recognition, but also their downstream tasks such as key information extraction. The toolbox supports a wide variety of state-of-the-art models for text detection, text recognition and key information extraction. ...

Downloads: 0 This Week

Last Update: 2023-07-04
See Project
6

funNLP

Resources, corpora, and tools for Chinese natural language processing

...It aggregates datasets, lexicons, wordlists, sentiment dictionaries, knowledge graphs, and pretrained model references, serving as a one-stop resource hub for Chinese NLP practitioners. The repository is organized into categories such as sentiment analysis, text classification, named entity recognition, knowledge graphs, and various lexicons (e.g. sensitive words, emotion dictionaries, stopwords). It also includes links to academic papers, open-source model implementations, and practical utilities like word segmentation or text cleaning scripts. The project is highly community-oriented, frequently updated with contributions and new resources, and it’s widely used in both academic and applied NLP research. ...

Downloads: 0 This Week

Last Update: 2025-10-01
See Project
7

Promptify

se GPT or other prompt based models to get structured output

...Instead of manually crafting prompts for each task, Promptify introduces a unified architecture that combines prompt templates, language model interfaces, and processing pipelines into a single framework. This approach allows developers to perform tasks such as text classification, named entity recognition, question answering, and information extraction using consistent prompt templates. The library supports integration with multiple large language model providers, enabling users to experiment with various models without changing their overall workflow.

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
8

wukong-robot

Chinese voice dialogue robot/smart speaker project

wukong-robot is a Chinese voice assistant / smart speaker project built to let makers and hackers design highly customizable voice-controlled devices. It combines wake-word detection, automatic speech recognition, natural language understanding, and text-to-speech into a single framework aimed at the Chinese-speaking ecosystem. The project is positioned as a simple, flexible, and elegant platform that can run on devices like Raspberry Pi and other Linux-based boards, making it suitable for DIY smart speakers and home-automation hubs. It supports multi-turn conversational capabilities powered by ChatGPT or other large language models, letting users have continuous dialogues rather than one-shot commands. ...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
9

VATSG

Video automatic transcribe and translated subtitle generator

...This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you are a general user who want to view any video file and mp3 file to your language, It will provide way. It's very easy to use because it has simple gui and very intuitive. So you can easily use it for any purpose. Now, you can choose to download either window installer setup type or uninstalled type. ...

Downloads: 6 This Week

Last Update: 2023-09-19
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
10

Img2Txt

Img2Txt - Extract Text From Images using AI

...Please Include the official Download Link What is Img2Txt? Img2Txt is a Python-based application packaged using PyInstaller that utilizes the power of pytesseract, an AI-powered optical character recognition (OCR) library, to extract text from images and convert it into plain text. The application features a simple and modern user-friendly interface created using customtkinter, allowing users to easily process images and obtain the text within them. Support me at : https://www.buymeacoffee.com/zsynctic it will motivate me and it will make me create more projects Support For any questions or issues, please open an issue on the Img2Txt GitHub repository. ...

1 Review

Downloads: 0 This Week

Last Update: 2023-08-15
See Project
11

Tensorflow Transformers

State of the art faster Transformer with Tensorflow 2.0

Imagine auto-regressive generation to be 90x faster. tf-transformers (Tensorflow Transformers) is designed to harness the full power of Tensorflow 2, designed specifically for Transformer based architecture. These models can be applied on text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Faster AutoReggressive Decoding, TFlite support, creating TFRecords is simple. ...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
12

Texthero

Text preprocessing, representation and visualization from zero to hero

Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
13

Kashgari

Kashgari is a production-level NLP Transfer learning framework

Kashgari is a simple and powerful NLP Transfer learning framework, build a state-of-art model in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS), and text classification tasks.

Downloads: 0 This Week

Last Update: 2024-08-09
See Project
14

PaddlePaddle models

Pre-trained and Reproduced Deep Learning Models

Pre-trained and Reproduced Deep Learning Models ("Flying Paddle" official model library, including a variety of academic frontier and industrial scene verification of deep learning models) Flying Paddle's industrial-level model library includes a large number of mainstream models that have been polished by industrial practice for a long time and models that have won championships in international competitions; it provides many scenarios for semantic understanding, image classification, target detection, image segmentation, text recognition, speech synthesis, etc. An end-to-end development kit that meets the needs of enterprises for low-cost development and rapid integration. The model library of Flying Paddle is an industrial-level model library tailored around the actual R&D process of domestic enterprises, serving enterprises in many fields such as energy, finance, industry, and agriculture.

Downloads: 0 This Week

Last Update: 2022-08-01
See Project
15

fastNLP

fastNLP: A Modularized and Extensible NLP Framework

...Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). Trainer provides a variety of built-in Callback functions to facilitate experiment recording, exception capture, etc. Automatic download of some datasets and pre-trained models.

Downloads: 0 This Week

Last Update: 2022-08-05
See Project
16

Delta ML

Deep learning based natural language and speech processing platform

...Use configuration files to easily tune parameters and network structures. What you see in training is what you get in serving: all data processing and features extraction are integrated into a model graph. Text classification, named entity recognition, question and answering, text summarization, etc. Uniform I/O interfaces and no changes for new models.

Downloads: 0 This Week

Last Update: 2022-08-15
See Project
17

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
18

Tensorflow and deep learning

A crash course in six episodes for software developers

...The repository covers core neural network concepts such as weights, biases, activation functions, and gradient descent, as well as more advanced techniques like convolutional networks, recurrent networks, and reinforcement learning. It includes multiple hands-on projects, such as handwritten digit recognition, airplane detection in images, and text generation using recurrent neural networks, which demonstrate how different architectures solve real-world problems.

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
19

GoodByeCatpcha

Solver ReCaptcha v2 Free

An async Python library to automate solving ReCAPTCHA v2 by images/audio using Mozilla's DeepSpeech, PocketSphinx, Microsoft Azure’s, Google Speech and Amazon's Transcribe Speech-to-Text API. Also image recognition to detect the object suggested in the captcha. Built with Pyppeteer for Chrome automation framework and similarities to Puppeteer, PyDub for easily converting MP3 files into WAV, aiohttp for async minimalistic web-server, and Python’s built-in AsyncIO for convenience.

Downloads: 4 This Week

Last Update: 2020-06-24
See Project
20

NLP Best Practices

Natural Language Processing Best Practices & Examples

In recent years, natural language processing (NLP) has seen quick growth in quality and usability, and this has helped to drive business adoption of artificial intelligence (AI) solutions. In the last few years, researchers have been applying newer deep learning methods to NLP. Data scientists started moving from traditional methods to state-of-the-art (SOTA) deep neural network (DNN) algorithms which use language models pretrained on large text corpora. This repository contains examples and...

Downloads: 0 This Week

Last Update: 2022-08-01
See Project
21

Budou

Budou is an auto organizer tool for beautiful line breaking in CJK

...Budou can be used via command line, in Python scripts, or integrated into web applications, and it provides advanced options such as caching and entity recognition for improved segmentation accuracy.

Downloads: 0 This Week

Last Update: 2025-10-11
See Project
22

easy12306

Automatic recognition of 12306 verification code

Automatic recognition of 12306 verification code using machine learning algorithm. Identify never-before-seen pictures.

Downloads: 0 This Week

Last Update: 2022-08-05
See Project
23

NeuroNER

Named-entity recognition using neural networks

Named-entity recognition (NER) aims at identifying entities of interest in the text, such as location, organization and temporal expression. Identified entities can be used in various downstream applications such as patient note de-identification and information extraction systems. They can also be used as features for machine learning systems for other natural language processing tasks.

Downloads: 0 This Week

Last Update: 2022-08-12
See Project
24

OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition

OpenSeq2Seq is a TensorFlow-based toolkit for efficient experimentation with sequence-to-sequence models across speech and NLP tasks. Its core goal is to give researchers a flexible, modular framework for building and training encoder–decoder architectures while fully leveraging distributed and mixed-precision training. The toolkit includes ready-made models for neural machine translation, automatic speech recognition, speech synthesis, language modeling, and additional NLP tasks such as...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
25

uncaptcha

Defeating Google's audio reCaptcha with 85% accuracy

uncaptcha is an open-source proof-of-concept system designed to demonstrate vulnerabilities in Google’s audio reCAPTCHA challenges by automatically solving them using speech recognition techniques. The project uses browser automation to navigate to CAPTCHA challenges, extract audio files, and process them through multiple speech-to-text services. By combining outputs from several transcription engines, the system increases the likelihood of correctly identifying the spoken digits or phrases required to solve the challenge. ...

Downloads: 0 This Week

Last Update: 2026-03-17
See Project

Previous
1
2
3
You're on page 4
5
Next

Related Searches

dms

image to text

text recognition matlab project

mp3 to srt

image2text

arabic speech recognition

auto captcha solver

photo to text

photo

gematria

Related Categories

Artificial Intelligence

Software Development

Formats and Protocols

Multimedia

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise