jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
PROJECT DEVELOPMENT MOVED TO GITHUB!
EmoFilt enables the free-for-non-commercial-use speech synthesis engine MBROLA to sound emotional by manipulating the phonetic description. It does so by modifying melody and rhythm of the speech, matching a target emotion. It is available for 34 languag
A cross-platform wrapper for common text-to-speech engines in Python
Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python.
Steel currently supports the following TTS software:
- Microsoft Speech API 5 (SAPI5)
- eSpeak
- NS Speech Synthesis
- FreeTTS
Documentation: http://sourceforge.net/p/steeltts/wiki/
Bug Tracker: http://sourceforge.net/p/steeltts/tickets/
If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead developer, Jasper Danielson, at jrd4@rice.edu.
VEDICS (Voice Enabled Desktop Interaction and Control System) is an assistive software which lets the user to interact with the OS using voice commands. Using this software the user can access any element found on the user's screen.
Clavier virtuel et synthétiseur vocal pour les personnes ne pouvant plus parler et ayant du mal à utiliser leurs mains. Virtual keyboard and speech synthetiser for people with reduced mobility and unability to speak. In French and english.
This is an application that takes the input of ABNF code and then converts it to GRXML. Both standards adhere to the W3 standard of grammars for speech recognition.
Matsig is an object-oriented signal class library (Toolbox in MATLAB lingo) for MATLAB 6.5 and later. It implements a signal class, simplifying operations and manipulations common in audio signal processing and speech processing.
AGTK is a suite of software components for building tools for annotating linguistic signals,
time-series data which documents any kind of linguistic behavior (e.g. audio, video).
The internal data structures are based on annotation graphs.
Project c2h - cetacean to human - building Seadragon, a tool for the scientific research of the acoustic communication of cetaceans, supporting the creation, emission, and recognition of underwater whistles. The blog: http://leafyseadragon.blogspot.com/
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.