Showing 33 open source projects for "text recognition"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu and Ali to complete text recognition locally. ...
    Downloads: 71 This Week
    Last Update:
    See Project
  • 2
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    annyang!

    annyang!

    Speech recognition for your site

    ...You can easily add a GUI for the user to interact with Speech Recognition using Speech KITT. Speech KITT is fully customizable and comes with many different themes, and instructions on how to create your own designs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Saber

    Saber

    The cross-platform open-source app built for handwriting

    Saber is the notes app built for handwriting. It's designed to be as simple and intuitive as possible, while still delivering unique features that you'll actually use. Additionally, Saber is available across all your devices, large and small, and syncs between them seamlessly. Only you can access your notes. You can sync your notes across devices knowing that they are encrypted and stored securely, and not even the server can read them. Notably, it can invert your notes when you're in dark...
    Downloads: 21 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    SCAIL

    SCAIL

    Towards Studio-Grade Character Animation via In-Context Learning of 3D

    ...While specific documentation about SCAIL’s exact goals and implementation is limited from the repository context alone, the project appears to be part of a collection of machine learning and AI research tools that facilitate scalable model development, evaluation, or application workflows. Given its listing alongside other ZAI projects like speech recognition and text-to-speech systems, SCAIL likely emphasizes scalable, composable AI learning frameworks that support researchers and practitioners in experimenting with learning algorithms, datasets, and model components. The repository structure suggests a focus on flexibility and extensibility, with potential integration into other ZAI tooling for training or analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 7
    JSpeech

    JSpeech

    Java library designed to integrate Speech-to-Text

    jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 196 This Week
    Last Update:
    See Project
  • 9
    Kindle Mate(KMate)

    Kindle Mate(KMate)

    Kindle clippings and Kindle Vocabulary Builder manager

    KMate is the ultimate reading companion for Kindle users — and the all-new, cross-platform successor to Kindle Mate, the classic Kindle notes manager trusted by readers worldwide for over a decade. It is the only Kindle assistant that unifies cross-device import, cloud sync, vocabulary & dictionary management, flexible export, reading analytics, and AI-powered definitions — all in one app. ## KMate 3 for Windows latest (Store...
    Downloads: 25 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    The Hear

    The Hear

    The Hear program is made for journalists.

    To transcribe audio, the app uses the built-in speech recognition features of macOS. Turn your audio and video files into text You can change the font size by pressing the command and +/- keys. The font size is saved during further use. A folder "Hear" with text files is created on the desktop. The program is universal - arm64/x86_64 You can ask questions here https://sourceforge.net/p/the-hear/discussion
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    VATSG

    VATSG

    Video automatic transcribe and translated subtitle generator

    ...This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you are a general user who want to view any video file and mp3 file to your language, It will provide way. It's very easy to use because it has simple gui and very intuitive. So you can easily use it for any purpose. Now, you can choose to download either window installer setup type or uninstalled type. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    Convert-Screenshot-To-Text
    "Note that by default, three languages are selected. If you only need to recognize English, please only select English." -No installation required. It's ready to use as soon as you open it.- I have made a major upgrade to CSTT this time, including support for all Tesseract-supported languages, improved OCR accuracy, added multiple recognition modes, added keyboard shortcuts for canvas movement and zooming, and enabled users to adjust OCR settings. If you like it, please support...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    Frontend Park

    Frontend Park

    Frontend projects and experiments

    frontend-park is a collection of engaging frontend projects and experiments, showcasing various web technologies. It includes examples ranging from pose and face recognition using TensorFlow.js to WebRTC-based video calls and Three.js 3D animations. This repository serves as a playground for exploring and learning modern frontend development techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    ...Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.
    Leader badge
    Downloads: 89 This Week
    Last Update:
    See Project
  • 15
    chatbot_chung
    chatbot chung is a keywords based probabilities algorythm simple entertainment chatbot with 3D talking openGL avatars written in freebasic. Can import aiml simple question/answer or question/random/answers or single star/ multi srai data saved from "AIML_chung" open source application . Online html5 javascript version with 44 languages multilingual auto detection available on the website (source included in the zip file). SORT gentext text generation algorythm option added (desktop version) .
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    AudioCuesheetEditor

    AudioCuesheetEditor

    Write audio cuesheets fast and efficient

    Annoyed of writing your audio cuesheets via text editor? Here comes the solution! AudioCuesheetEditor helps you editing your audio cuesheet file. With AudioCuesheetEditor you can import textfiles, edit the imported data and afterwards export to a working audio cuesheet. There are also other export posibilities like csv or xml. Feel free to try it ;).
    Downloads: 16 This Week
    Last Update:
    See Project
  • 18
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    H.B.S.N

    H.B.S.N

    Speech Recognition System

    H.B.S.N is a simple speech recognition software which programmed using Java. This software is a package of many sub applications.And those are as listed below , Audio Player Video Player Email Client Weather Application Mp3 Tag Editor Picture Viewer Home Automation Application Alarm / Timer Folder Locker Message Encrypt Application Income & Expenses Logging Application Apart from that we can do many thing from this software by using voice commands , such as , Open & close applications which are installed in the computer Open web sites Open folders which are in the HDD Control built-in audio & video player Control the home automation system Reading mails Reading selected text Speaking clock ( Time & Date) Speaking weather report There are system commands for the tasks which this application does.And we can replace the system default commands with custom commands.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    OCR Web based

    OCR Web based

    OCR web based for Browser Firefox & PC

    ...id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled the possibility to import an image from url. Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    cbrTekStraktor

    an application to automatically extract text from comic books.

    ...The text extraction is achieved by a combination of statistical and graphical processing operations. It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 24
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Vision2u

    Vision2u

    free image processing software

    Vision2u offers a free image processing software for personal use and research. Primary tasks of the image processing can be realized during simple operation of the software. Every Web cam owner can have simplest measuring, counting or tasks of monitoring done without high capital outlays.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB