Text to Speech engine for English and many other languages. Compact size with clear but artificial pronunciation. Available as a command-line program with many options, a shared library for Linux, and a Windows SAPI5 version.
A simple noise gate app intended for use with VOIPs like Skype.
Ever wanted to cut out background noise when talking with others on Skype? Now it's possible! NoiseGator is a light-weight noise gate application that routes audio through an audio input to an audio output. In real-time the audio level is analysed and if the average level is higher than the threshold the audio bypasses as normal. However, if the average level goes below the threshold, the gate closes and the audio is cut. When used with a virtual audio cable it can act as a noise gate for a either a sound input(microphone) or sound output(speakers). Can also be used to gate noise from your own mic or play your microphone through your speakers. REQUIREMENTS: - Java 7 or higher for Windows. - Java 6 or higher for Mac. Java 7 recommended. - A virtual audio cable is required for use with VOIPs: For Windows users I recommend the VB-Cable driver (http://vb-audio.pagesperso-orange.fr/Cable/index.htm). Mac users can use SoundFlower.
Low-latency, high quality voice chat for gamers
Mumble is an open source, low-latency, high quality voice chat software primarily intended for use while gaming. It includes game linking, so voice from other players comes from the direction of their characters, and has echo cancellation so the sound from your loudspeakers won't be audible to other players.
SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.
eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.
a tool for segmenting, labeling and transcribing speech
FreeTTS is a speech synthesis engine written entirely in the Java(tm) programming language. FreeTTS was written by the Sun Microsystems Laboratories Speech Team and is based on CMU's Flite engine. FreeTTS also includes a partial JSAPI 1.0
Subtitle translator from one natural language to other.
Translating subtitles in format SubRip from one natural language to other. It is based on Google Translate without API and therefore without payment. Translator have automatic and manual spell checkers.
speech recognition software for Polish language
Software for speech recognition in Polish language. Large vocabulary continuous speech recognition (LVCSR) and Commands. SkryBot recognises speech in Polish language and changes it into text by using: 1.microphone, 2.sound files (turned off in demo version). SkryBot recognises dictated speech and converts it into text. This means, that if you speak to the microphone or you use earlier recorded sound file, SkryBot will change it into text. SkryBot offers you: 1. audio conversion and cutting sound files into smaller ones, 2. searching for words or phrases in sound files (recognised by SkryBot), 3. editing sound files and automatic cutting off long silence parts in the recording, 4. improving accuracy of recognition. Versions of SkryBot: 1. SkryBot Prawo - for courts, lawyers, police, 2. SkryBot Administracyjny - for civil and government administration, 3. SkryBot Medycyna Rodzinna - for doctors, hospitals. https://sourceforge.net/p/skrybotdomowy/wiki/Home
MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
The project provides a ready-to-use interface for the julius CSR engine for a handicapped child which is not able to use the keyboard well. It integrates into X11 and Windows. Find out how you can help: http://simon-listens.org/index.php?support
A Biblia Falada é um software para leitura e estudo da Biblia Sagrada. Muito simples de usar e totalmente acessível para deficientes visuais, traz, além do novo sistema de leitura, os textos completos da edição Revista e Atualizada.
A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.
Clavier virtuel et synthétiseur vocal pour les personnes ne pouvant plus parler et ayant du mal à utiliser leurs mains. Virtual keyboard and speech synthetiser for people with reduced mobility and unability to speak. In French and english.
Virtual Hypnotist is a software application that aims to provide a virtual interactive hypnosis session framework, for many uses. It is a rewrite of the Hypnotizer 2000 software. See the readme.txt file for legal info.
A modular, extensible Hebrew text-to-speech engine tuned for Standard Israeli Hebrew, and associated tools.
A speech recognition system using Matlab/Simulink/Stateflow.
This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.
odt2daisy is an OpenOffice.org Writer extension, enabling to export in DAISY XML, Full DAISY (xml+audio) and Audiobook format. DAISY is an NISO Z39.86 standard for blind, visual impaired, print-disabled, and learning-disabled people.
Sermon Recorder is a program for recording sermons or anything else.
Sermon Recorder is a program for recording sermons, speeches or anything else. It has some special features, such as automatic filename creation with parameters, almost "dummy-resistant" and localized user-interface (currently English and German), commandline-call after recording stop and much more... The recorded data is directly stream-encoded and written into the defined files, so the risk in case of a PC crash is minimized. The file name can be entered or changed throughout the whole recording. The recorded files are then renamed after recording stopped. Ideas for new features and help concerning translation to more languages is welcome...
ILA is a fully customizable and teachable voice assistant for Java
ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. Hope you enjoy ILA - Florian
Audiobook Cutter is an easy-to-use tool which splits large speech MP3 files into smaller ones without re-encoding. The split points are determined by silent parts. The main purpose is to make audiobooks or podcasts more manageable in a user-friendly way.
Dhvani is Text-to-Speech System for Indic Languages. Current C- GNU/Linux implementation supports Hindi, Kannada, Marathi, Malayalam, Gujarati, Bengali, Telugu, Panjabi, Tamil and Oriya.
A plugin for Teamspeak3. This plugin allows you to autofollow a user.
A plugin for Teamspeak3. This plugin allows you to follow a user while he switches through channels. For the love menu just right click any name in the server view.
mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.
EMU is a collection of software tools for the creation, manipulation and analysis of speech databases. At the core of EMU is a database search engine which allows queries based on the sequential and hierarchical structure of the annotations.