Performs actions on detected volume threshold Examples : - Launch music on clap - Launch speech recording when you start speaking - Launch guard webcam when a significant sound is detected - Increase or decrease headphones volume when ambient noise pass
A collection of scripts and programs to automatically annotate video/audio for subtitles. Basically relies on a MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) plug-in for detecting human voice in polyphonic recordings.
A collection of tools for generating audio and visual (PNG/HTML/WAVE) for use in web sites including CAPTCHA challenges and PNG image creation tools with Javascript mouse tracking support.
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
ASR-Builder provides an easy-to-use interface to the HTK toolkit, that allows users to build ASR systems. ASR-Builder provides a platform that performs house-keeping tasks when using HTK and also provides default training/testing/recognition scripts.
AGTK is a suite of software components for building tools for annotating linguistic signals,
time-series data which documents any kind of linguistic behavior (e.g. audio, video).
The internal data structures are based on annotation graphs.
A simple software that speaks a text. You can type the text or appoint a file.
Fala is just a frontend to festival. It's designed for GNOME, but if you have gtk, pyhton and festival you are able to run it.
Software to fit whole-sentence language models using the principle of maximum entropy. For developers of speech recognizers, text prediction interfaces, OCR, machine translation software.
DJBorg turns your MP3 playlist into a personalized radio station, adding randomly-generated DJ banter between tracks. Song information (based on ID3 tags), news, weather, and headlines are announced via a text-to-speech engine.
The PyGE (Python Gutenberg E-text) project is a suite of
GUI desktop utilities written in Python to promote and
facilitate awareness and enjoyment of works of literature
that are available from the archives of Project Gutenberg.
SoccerPhone provides lives soccer scores by phone. The only league currently supported is US Major League Soccer. Support for Soccernet is under development. SoccerPhone is written in VoiceXML, Python, and JavaScript.
Sayz Me is a text-to-speech application for Windows. Text can be typed in or read from clipboard. Words are highlighted when spoken. Select voice, adjust reading speed, voice pitch, font and color. Simple and easy to use.
This project is intended for users who want to get more out of the voice modem they may have. Why another project for modem? Looking for the good quality software for the voice communication trough the modem, I could find only Win32 based. Linux now :)