Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Speech Recognition Software
Search Results

Search Results for "dvd-audio"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 44
Windows 38
Mac 28
More...
BSD 23
ChromeOS 14
Desktop Operating Systems 4
Mobile Operating Systems 4

Category

Artificial Intelligence 44
Multimedia 34
Scientific/Engineering 10
Software Development 8
Internet 3
Business 2
Communications 2
Education 2
Security 2
Database 1
Mobile 1

License

OSI-Approved Open Source 34
Other License 1
Public Domain 1

Translations

English 10
Arabic 2
Spanish 2
Dutch 1
More...
German 1
Polish 1

Programming Language

Java 14
C++ 9
C 8
Python 7
More...
Perl 4
JavaScript 3
C# 2
MATLAB 2
PHP 2
Simulink 2
Assembly 1
Free Pascal 1
Go 1
Lazarus 1
Pascal 1
Tcl 1
Unix Shell 1

Status

Pre-Alpha 10
Production/Stable 9
Beta 8
Planning 4
More...
Alpha 2
Inactive 1

Showing 44 open source projects for "dvd-audio"

View related business solutions

Speech Recognition Linux Clear Filters & Widen Search

Your monitoring isn't a stack. It's a pile. Fix that.
Errors, performance, logs, uptime. One install, one invoice, one UI.

Replace Datadog, New Relic, and Sentry without adding three more dashboards.

Free 30 days.
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. ...

Downloads: 6 This Week

Last Update: 3 days ago
See Project
2

Buster

Captcha solver extension for humans

Save time by asking Buster to solve captchas for you. Buster is a Firefox extension which helps you to solve difficult captchas by completing reCAPTCHA audio challenges using speech recognition. Challenges are solved by clicking on the extension button at the bottom of the reCAPTCHA widget. It is not guaranteed that challenges are always solved, the limitations of the technology need to be considered. The continued development of Buster is made possible thanks to the support of awesome backers. ...

Downloads: 37 This Week

Last Update: 2 days ago
See Project
3

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection.

Downloads: 81 This Week

Last Update: 2025-06-26
See Project
4

annyang!

Speech recognition for your site

annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
5

Voxal voice changer

Transform your voice in real-time voxal voice changer

Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality. Whether you're looking to have fun, protect your privacy, or create engaging content, we have the perfect voice for you. Audio can be captured from various sources, pre-listening is available, and the most popular audio formats are supported.

1 Review

Downloads: 33 This Week

Last Update: 2025-11-16
See Project
6

Vosk Desktop

Desktop software for controlling the Vosk Speech Recognition Toolkit

Downloads: 0 This Week

Last Update: 2023-08-10
See Project
7

VideoSrt

Windows-GUI

...Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. ...

Downloads: 17 This Week

Last Update: 2023-01-13
See Project
8

wav2letter++

Facebook AI research's automatic speech recognition toolkit

...After installing, run export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. A sample is specified using 4 columns separated by space (or tabs).

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
9

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
Stop vibe-debugging.
Plug Claude into your app's actual errors.

AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.

Free 30 days.
10

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

...SkryBot Administracyjny - for civil and government administration. 3. SkryBot Medycyna Rodzinna - for physicians Professional version of SkryBot (commercial) offers you: 1. Audio conversion and cutting sound files into smaller ones. 2. Searching for words or phrases in sound files (recognized by SkryBot). 3. Editing sound files and automatic cutting off long silence parts in audio file.

2 Reviews

Downloads: 0 This Week

Last Update: 2020-03-15
See Project
11

CMU Sphinx

Speech Recognition Toolkit

Thank you for visiting! ----> Maintenance and improvement work has MOVED to https://cmusphinx.github.io/ Please go there for the most recent software and documentation. <---- CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

58 Reviews

Downloads: 235 This Week

Last Update: 2024-01-11
See Project
12

NASH OS

Nash Operating System for Modern Ecommerce

The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.

Downloads: 2 This Week

Last Update: 2019-03-24
See Project
13

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own,...

4 Reviews

Downloads: 0 This Week

Last Update: 2018-07-23
See Project
14

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 3 This Week

Last Update: 2018-05-11
See Project
15

FM2TXT

RtlSdr listen to radio, recognize audio, and writes text file log

Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt

Downloads: 1 This Week

Last Update: 2017-12-17
See Project
16

Lip Reading

Cross Audio-Visual Recognition using 3D Architectures

The input pipeline must be prepared by the users. This code is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work. Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing information. ...

Downloads: 4 This Week

Last Update: 2022-08-11
See Project
17

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...

Downloads: 2 This Week

Last Update: 2019-08-21
See Project
18

High-order HMM in Matlab

Implementation of duration high-order hidden Markov model in Matlab.

Implementation of duration high-order hidden Markov model (DHO-HMM) in Matlab with application in speech recognition.

2 Reviews

Downloads: 0 This Week

Last Update: 2015-02-15
See Project
19

jaivox

Speech recognition application builder and library

Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
20

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16
See Project
21

Voce

A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.

3 Reviews

Downloads: 0 This Week

Last Update: 2013-10-03
See Project
22

HMM Speech Recognition in Java

HMM Speech Recognition in Java

HMM Speech Recognition in Java

Downloads: 0 This Week

Last Update: 2013-09-21
See Project
23

HMM Speech Recognition in Matlab

A speech recognition system using Matlab/Simulink/Stateflow.

This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.

4 Reviews

Downloads: 0 This Week

Last Update: 2016-07-25
See Project
24

High-order HMM in Java

A duration high-order hidden Markov model (DHO-HMM) in Java.

This project provides an implementation of duration high-order hidden Markov model (DHO-HMM) in Java. It is compactible with JDK 5 & 6. It was used in the author's research on speech recognition of Mandarin digits. There are some Chinese words in this project and I am afraid that I don't have enough time to translate to English recently.

Downloads: 0 This Week

Last Update: 2013-09-16
See Project
25

audioLock

...It is a simple base program that is very flexible in terms of expansion, with further potential implementation as a means to password protect files, authenticate other applications, or act as a password manager. The AudioLock voice password application is designed to run on locally on an android device and to allow users to setup an account with a master and audio password, edit a preexisting account, utilize voice and master password to lock and unlock chosen device.

Downloads: 0 This Week

Last Update: 2012-12-10
See Project

Previous
You're on page 1
2
Next

Related Searches

voice changer

whisper-windows-x64.exe

whisper

pyaudio-0.2.11-cp314-cp314-win_amd64.whl

cmusphinx-zh-cn-5.2.tar.gz

pyaudio-0.2.14-cp314-cp314-win_amd64.whl

pyaudio-0.2.14-cp312-cp312-win_amd64.whl

whisper-bin-x64.zip

phone flash softwares

pyaudio

Related Categories

Artificial Intelligence

Multimedia

Scientific/Engineering

Software Development

Internet

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise