Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Speech Recognition Software
Search Results

Search Results for "using"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 29
Linux 21
Mac 17
More...
BSD 3
ChromeOS 1
Mobile Operating Systems 1

Category

Artificial Intelligence 37
Multimedia 10
Scientific/Engineering 4
Business 2
Communications 1
Database 1
Education 1
Internet 1
Security 1
Software Development 1
System 1
Terminals 1

License

OSI-Approved Open Source 21
Public Domain 2
GNU Free Documentation License 1
Other License 1

Translations

English 5
German 1
Indonesian 1

Programming Language

Python 9
C++ 3
C# 3
Java 2
More...
JavaScript 2
Visual Basic 2
BASIC 1
C 1
Delphi/Kylix 1
Go 1
MATLAB 1
Pascal 1
Simulink 1
Visual Basic .NET 1

Status

Beta 5
Pre-Alpha 4
Alpha 3
Production/Stable 3

Showing 37 open source projects for "using"

View related business solutions

Speech Recognition Clear Filters & Widen Search

Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)

Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.

Learn More
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
1

Buster

Captcha solver extension for humans

Save time by asking Buster to solve captchas for you. Buster is a Firefox extension which helps you to solve difficult captchas by completing reCAPTCHA audio challenges using speech recognition. Challenges are solved by clicking on the extension button at the bottom of the reCAPTCHA widget. It is not guaranteed that challenges are always solved, the limitations of the technology need to be considered. The continued development of Buster is made possible thanks to the support of awesome backers. If you'd like to join them, please consider contributing with Patreon, PayPal or Bitcoin. ...

Downloads: 40 This Week

Last Update: 2024-06-04
See Project
2

SpeechRecognition

Speech recognition module for Python

...Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. ...

Downloads: 12 This Week

Last Update: 2025-12-31
See Project
3

Omnilingual ASR

Omnilingual ASR Open-Source Multilingual SpeechRecognition

Omnilingual-ASR is a research codebase exploring automatic speech recognition that generalizes across a very large number of languages using shared modeling and training recipes. It focuses on leveraging self-supervised audio pretraining and scalable fine-tuning so low-resource languages can benefit from high-resource data. The project provides data preparation pipelines, training scripts, decoding utilities, and evaluation tools so researchers can reproduce results and extend to new language sets. ...

Downloads: 1 This Week

Last Update: 2025-12-12
See Project
4

AzioSpeech Recognition and Translation

AzioSpeech Recognition and Translation

Starting from version 1.2.1.0, the project has been renamed to AzioSpeech Recognition and Translation and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PFV5DG73198 A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy. Important Setup Requirements Before using this application, you MUST have: 1. Azure Account Setup Active Azure Subscription - Create a free account at portal.azure.com Azure Speech Service Resource - You must create your own Speech Service within your Azure subscription Valid API Key & Region - Obtain these credentials from your Azure Speech Service resource 2. ...

Downloads: 1 This Week

Last Update: 2025-11-27
See Project
Outgrown Windows Task Scheduler?
Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.

Download Free Tool
5

Mice MX OS speech to text Voice Control

Mice speech to text with MX Cinnamon OS ISO

Note about this image This image contains a system based on Linux MX, which was created to improve accessibility within the Linux environment. The distribution uses the Cinnamon desktop interface, which is configured to be operated using voice commands and outputs. The user interface and the control of your own devices and home automation systems can be customized and extended. The voice control program MiceStTM.py was developed to enable easy adaptation to other languages. However, only German settings are currently implemented. category: System commands comment: Screen grid trigger: Display screen (Ras....

Downloads: 0 This Week

Last Update: 2025-05-14
See Project
6

Maia

MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.

Downloads: 0 This Week

Last Update: 2024-04-21
See Project
7

ASRT Speech Recognition

A Deep-Learning-Based Chinese Speech Recognition System

ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.

Downloads: 1 This Week

Last Update: 2025-07-03
See Project
8

VideoSrt

Windows-GUI

...Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.

Downloads: 49 This Week

Last Update: 2023-01-13
See Project
9

EasyGradeXL

Uses speech recognition to enter grades in an Excel workbook

This application simplifies the tedious task of entering grades in a Excel workbook by using the Google text-to-speech API. This API currently supports 137 languages and a number of dialects. The application keeps a log of the grades, in the order that they are entered and provides a readback function to easily check if the grades were entered correctly. This application was developed using Microsoft Excel Version 2108.

Downloads: 0 This Week

Last Update: 2021-12-21
See Project
Grafana: The open and composable observability platform
Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.

Learn More
10

wav2letter++

Facebook AI research's automatic speech recognition toolkit

...This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. A sample is specified using 4 columns separated by space (or tabs).

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
11

ASR for Medical Reporting

Automatic speech recognition system for medical reporting in spanish.

This is a functional prototype of automatic speech recognition system for medical reporting in Spanish using CMU Sphinx4 ASR toolkit. This ASR use pre-trained acoustic model and context dependent language model in nuclear medicine diagnostics.

Downloads: 0 This Week

Last Update: 2020-07-15
See Project
12

KALDI IVR ASTERISK SPEECH

Working template to create an Asterisk IVR system using kaldi

Working template to create an Asterisk IVR system using kaldi for speech recognition. IVR based speech recognition.

Downloads: 0 This Week

Last Update: 2018-12-15
See Project
13

annyang!

Speech recognition for your site

...Grab the latest version of annyang.min.js, drop it in your html, and start adding commands. You can easily add a GUI for the user to interact with Speech Recognition using Speech KITT. Speech KITT is fully customizable and comes with many different themes, and instructions on how to create your own designs.

Downloads: 0 This Week

Last Update: 2021-09-13
See Project
14

H.B.S.N

Speech Recognition System

H.B.S.N is a simple speech recognition software which programmed using Java. This software is a package of many sub applications.And those are as listed below , Audio Player Video Player Email Client Weather Application Mp3 Tag Editor Picture Viewer Home Automation Application Alarm / Timer Folder Locker Message Encrypt Application Income & Expenses Logging Application Apart from that we can do many thing from this software by using voice commands , such as , Open & close applications which are installed in the computer Open web sites Open folders which are in the HDD Control built-in audio & video player Control the home automation system Reading mails Reading selected text Speaking clock ( Time & Date) Speaking weather report There are system commands for the tasks which this application does.And we can replace the system default commands with custom commands.

1 Review

Downloads: 0 This Week

Last Update: 2018-06-30
See Project
15

MDictate

Speech to text using python, pocketsphinx, ready to deploy

Automated speech recognition software is extremely cumbersome. This project's aim is to incrementally improve the quality of an open-source and ready to deploy speech to text recognition system. Runs on Windows using the mdictate.exe, but the core workings are found in the mdictate.py script which should work on Windows/Linux/OS X. In version 1.0, we use pocketsphinx' default setup with a basic graphic interface.

Downloads: 0 This Week

Last Update: 2018-03-15
See Project
16

FM2TXT

RtlSdr listen to radio, recognize audio, and writes text file log

Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt

Downloads: 2 This Week

Last Update: 2017-12-17
See Project
17

Lip Reading

Cross Audio-Visual Recognition using 3D Architectures

...We proposed the utilization of a coupled 3D Convolutional Neural Network (CNN) architecture that can map both modalities into a representation space to evaluate the correspondence of audio-visual streams using the learned multimodal features.

Downloads: 1 This Week

Last Update: 2022-08-11
See Project
18

JAVT - Just Another Voice Transformer

Just Another Speech Recognition and Text to Speech software.

JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.

Downloads: 4 This Week

Last Update: 2020-08-19
See Project
19

Speech Recognition Using MFCC-DTW

Downloads: 0 This Week

Last Update: 2017-01-26
See Project
20

Wilson Personal Assistant

Personal Assistant using speech recognition and speech synthesis

...Rather than responding to canned commands, it will process the sentence spoken to it, and decide if it is actionable or a conversation. 1st stage is to finalize the speech recognition and bot personality. 2nd stage to incorporate a knowledge base using NELL , Wolfram Alpha, and Google API. This will allow the bot to answer any question. 3rd stage is the personal assistant. Calendar , email , finance, organization management. Media control, device & file management. The project will be kept modular. AIMLbot source: https://sourceforge.net/projects/aimlbot/

Downloads: 0 This Week

Last Update: 2016-11-07
See Project
21

CSLU_KALDI

speach recognision using kaldi

adjusting KALDI speech recognition to new corpus.

Downloads: 0 This Week

Last Update: 2015-05-03
See Project
22

Specimen Photography for Canon Powershot

SpecimenPhoto controls a Canon Powershot camera for specimen archival photography. Each photograph is assigned a case number, labeled and stored. Identification is manual or "hands free" using separately available barcode and speech recognition modules.

Downloads: 0 This Week

Last Update: 2015-04-08
See Project
23

Speech Recognition System

Speech Recognition System - Matlab source code

Speech recognition technology is used more and more for telephone applications like travel booking and information, financial account information, customer service call routing, and directory assistance. Using constrained grammar recognition, such applications can achieve remarkably high accuracy. Research and development in speech recognition technology has continued to grow as the cost for implementing such voice-activated systems has dropped and the usefulness and efficacy of these systems has improved. For example, recognition systems optimized for telephone applications can often supply information about the confidence of a particular recognition, and if the confidence is low, it can trigger the application to prompt callers to confirm or repeat their request. ...

Downloads: 1 This Week

Last Update: 2015-03-18
See Project
24

HMM Speech Recognition in Matlab

A speech recognition system using Matlab/Simulink/Stateflow.

This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.

4 Reviews

Downloads: 0 This Week

Last Update: 2016-07-25
See Project
25

Odin ASR

Automatic Speech Recognition

Automatic Speech Recognition using: -HTK toolkit -TIDIGIT (Connected Digits Texas Instruments) -HMM method -VTLN method

Downloads: 0 This Week

Last Update: 2013-05-30
See Project

Previous
You're on page 1
2
Next

Related Searches

captcha

text to speech

speech

mi 10

cinnamon desktop

video to srt

easygradexl

arabic audio transcription

speech recognition using matlab

ivr for asterisk

Related Categories

Artificial Intelligence

Multimedia

Scientific/Engineering

Business

Communications

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: