A plugin for pidgin that interfaces with the popular program festival. It allows for instant messages to be spoken by festival so you can hear it through your speakers.
A collection of scripts and programs to automatically annotate video/audio for subtitles. Basically relies on a MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) plug-in for detecting human voice in polyphonic recordings.
A collection of tools for generating audio and visual (PNG/HTML/WAVE) for use in web sites including CAPTCHA challenges and PNG image creation tools with Javascript mouse tracking support.
ASR-Builder provides an easy-to-use interface to the HTK toolkit, that allows users to build ASR systems. ASR-Builder provides a platform that performs house-keeping tasks when using HTK and also provides default training/testing/recognition scripts.
EasyMail is a open source Emailprogram especially for cognitive disabled persons. Its completly written in C# and supports features like multiuser support, voicemails and a wizzard for protecting cognitiv overload.
----------Description in progress-----
A text to speech converter which will be able to read any document(Presently it is reading text and .doc files).The main aim of the project is to make reading an interesting task and assist BLIND people.
Speech based User Interface Components Library for Java is a project to create Java controls and applications that can be used not only by literate people but also by non-literates. Speech and visual element with minimal text is used to create components
BladeWareVXML is a portable VoiceXML 2.1 interpreter that is an enhanced version (performance, usability and integration) of OpenVXI. A commercial version, with documentation, sample code, and support options, is available from the Commetrex Website.
The SingIt Lyric Displayer is an XMMS plugin which displays formatted lyrics, including id3v2xx lyrics. It consists of the displayer and an integrated editor which allows one to easily insert time stamps, edit the text, and export & strip HTML.
The F Modular Synthesizer is a tool for creating all kinds of sounds, white and low-band noises and midi-like tunes with modulated frequencies, volumes, balances and many more features.
Implementation of Media Resource Control Protocol Client (MRCP). Supports ASR and TTS functionality. Design pattern implementation. Documentation, sample application and library source code.
eSpeak text-to-speech module for Asterisk. This provides the "espeak" dialplan application, which allows you to use the eSpeak TTS Engine as a speech synthesizer in Asterisk.
eXtace is a 3D audio visualization tool (or eye candy depending on how you look at it). eXtace requires ESD (Esound) for its sound input source. It performs a FFT (fast fourier transform) on audio and displays it via various graphical modes.
Matsig is an object-oriented signal class library (Toolbox in MATLAB lingo) for MATLAB 6.5 and later. It implements a signal class, simplifying operations and manipulations common in audio signal processing and speech processing.
Flite text-to-speech module for Asterisk. This provides the "Flite" dialplan application, which allows you to use the Flite TTS Engine as a speech synthesizer in Asterisk.
The AccessMusic project provides a set of tools for creating accessible music which are freely available to download. These tools allow you to convert music scores from traditional western music notation to formats for the Blind and visually impaired.
A patent-free audio codec designed especially for voice (unlike Vorbis which targets general audio) signals and providing good narrowband and wideband quality. This project is complementary to the Ogg Vorbis codec.
The SpeakRight Framework is a speech application framework written in Java. SpeakRight applications are fast to create and work on any (VoiceXML) speech platform. Applications are written in Java with full debug and unit testing available.