Showing 34 open source projects for "text"

View related business solutions
  • Outgrown Windows Task Scheduler? Icon
    Outgrown Windows Task Scheduler?

    Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

    Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.
    Download Free Tool
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    Whisper

    Whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...
    Downloads: 55 This Week
    Last Update:
    See Project
  • 2
    NVIDIA NeMo

    NVIDIA NeMo

    Toolkit for conversational AI

    NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Scribe

    Scribe

    Free, open-source, and offline speech-to-text & voice control app.

    > Scribe is a free and open-source desktop assistant that brings powerful speech-to-text and voice control capabilities directly to your PC. It allows you to dictate text into any application, create custom voice commands, launch programs, and automate your workflow with text replacements. > Designed with privacy as a top priority, Scribe works completely offline. Your voice data never leaves your computer. Powered by the Vosk engine, it supports multiple languages and provides high-quality recognition without an internet connection...
    Downloads: 78 This Week
    Last Update:
    See Project
  • 4
    AzioSpeech Recognition and Translation

    AzioSpeech Recognition and Translation

    AzioSpeech Recognition and Translation

    Starting from version 1.2.1.0, the project has been renamed to AzioSpeech Recognition and Translation and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PFV5DG73198 A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy. Important Setup Requirements Before using this application, you MUST have: 1. Azure Account Setup Active Azure Subscription - Create a free account at portal.azure.com Azure Speech Service Resource - You must create your own Speech Service within your Azure subscription Valid API Key & Region - Obtain these credentials from your Azure Speech Service resource 2. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    VideoSrt

    VideoSrt

    Windows-GUI

    ...Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. ...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 6

    EasyGradeXL

    Uses speech recognition to enter grades in an Excel workbook

    This application simplifies the tedious task of entering grades in a Excel workbook by using the Google text-to-speech API. This API currently supports 137 languages and a number of dialects. The application keeps a log of the grades, in the order that they are entered and provides a readback function to easily check if the grades were entered correctly. This application was developed using Microsoft Excel Version 2108. It currently only runs under Microsoft Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Tensor2Tensor

    Tensor2Tensor

    Library of deep learning models and datasets

    Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    The SARK Voice Pad

    The SARK Voice Pad

    A text editor with narration and speech recognition

    A fully featured advanced text editor
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Speech Recognition in English & Polish

    Speech Recognition in English & Polish

    Speech recognition software for English & Polish languages

    Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Create and run cloud-based virtual machines. Icon
    Create and run cloud-based virtual machines.

    Secure and customizable compute service that lets you create and run virtual machines.

    Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
    Try for free
  • 10
    CMU Sphinx

    CMU Sphinx

    Speech Recognition Toolkit

    Thank you for visiting! ----> Maintenance and improvement work has MOVED to https://cmusphinx.github.io/ Please go there for the most recent software and documentation. <---- CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.
    Leader badge
    Downloads: 396 This Week
    Last Update:
    See Project
  • 11
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ...It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. Grab the latest version of annyang.min.js, drop it in your html, and start adding commands. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    H.B.S.N

    H.B.S.N

    Speech Recognition System

    ...This software is a package of many sub applications.And those are as listed below , Audio Player Video Player Email Client Weather Application Mp3 Tag Editor Picture Viewer Home Automation Application Alarm / Timer Folder Locker Message Encrypt Application Income & Expenses Logging Application Apart from that we can do many thing from this software by using voice commands , such as , Open & close applications which are installed in the computer Open web sites Open folders which are in the HDD Control built-in audio & video player Control the home automation system Reading mails Reading selected text Speaking clock ( Time & Date) Speaking weather report There are system commands for the tasks which this application does.And we can replace the system default commands with custom commands.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MDictate

    MDictate

    Speech to text using python, pocketsphinx, ready to deploy

    Automated speech recognition software is extremely cumbersome. This project's aim is to incrementally improve the quality of an open-source and ready to deploy speech to text recognition system. Runs on Windows using the mdictate.exe, but the core workings are found in the mdictate.py script which should work on Windows/Linux/OS X. In version 1.0, we use pocketsphinx' default setup with a basic graphic interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Speechalyzer

    Speechalyzer

    Process large speech data wrt transcription, labeling and annotation

    ...It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Wilson Personal Assistant

    Wilson Personal Assistant

    Personal Assistant using speech recognition and speech synthesis

    A personal assistant utilizing MS SAPI 5.4 and Program# AIMLbot (AIML Set 1.0) https://sourceforge.net/projects/aimlbot/ The goal for this project is a full personal assistant that understands speech and can respond naturally. To set this apart from other personal assistants, this PA has a "personality" via AIML. Rather than responding to canned commands, it will process the sentence spoken to it, and decide if it is actionable or a conversation. 1st stage is to finalize the speech...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    S.T.A.N Voice Recognition Software

    a software like J.A.R.V.I.S!!

    Hello everyone! I am presenting a software named S.T.A.N, a voice recognition software which recognizes your voice and performs actions like from opening to facebook to renaming, copying a file, creating a folder and many more! This software has over 50+ commands to perform by just listening to your voice! so just download it and check it out. Your comments and feedback is valuable to us. And if you like, please like our facebook page and mail us!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Responding Partner

    Responding Partner

    Control your PC computer with voice commands

    ...It also enables voice commands and controls to your computer for small task like open media files, open and close program, shutdown and restart computer,open website, type in editor, text to speech,etc. You can extend the ability by installing new plugin which available at files tab. We will continuous to update new plugin and animation character. Engine inside: - Speech Recognition - Text to Speech Requirements - Microphone
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22

    voicerc

    Voice recognition utility

    This voice recognition utility that uses Google Speech API and returns recognized text into standard output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Transcription Aid

    Transcription Aid

    Transcription Aid helps you type text from recordings.

    This software is to help type in text from speech recordings. It has several functions proven to help this type of work. However it is fully manual (aside from auto-completion), so no speech recognition if you are looking for that, but it is a great tool to do the job.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Speak Up

    Speak Up

    Text to Speech software

    'Speak Up' is a TEXT TO SPEECH software.That's not all it has speech recognition and much more.Try it for yourself and send us a feedback on our guestbook. 'Speak Up' 2,0 is packed with new features and upgrades.Check it out for yourself.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Interactive4J
    Project aim to provide simple easy APIs for Java developers to use interactive abilities in their Java Applications like speech recognition, handwriting recognition, use of web cam , sound record/play, decision trees , text to speech and many others.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next