Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio server, programming language, and IDE for sound synthesis
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
The open-source voice synthesis studio powered by Qwen3-TTS
Sonic Pi is your free code-based music creation and performance tool
Software synthesizer based on the SoundFont 2 specifications
Tokenizer-Free TTS for Multilingual Speech Generation
Collaborative programmable music
A multi-system chiptune tracker compatible with DefleMask modules
Free open source speech synthesizer for Russian and other languages
Controllable & emotion-expressive zero-shot TTS
Functional programming language for signal processing
Open Source Speech Language Model
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Framework for building real-time voice and multimodal AI agents
Transforming Multimodal Content into Captivating Multilingual Audio
A Systematic Framework for Interactive World Modeling
Translate the video from one language to another and embed dubbing
Offline Text To Speech synthesis for python
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Capable of understanding text, audio, vision, video
Swift audio synthesis, processing, & analysis platform
Stable diffusion for real-time music generation (web app)