Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio server, programming language, and IDE for sound synthesis
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
The open-source voice synthesis studio powered by Qwen3-TTS
Sonic Pi is your free code-based music creation and performance tool
Software synthesizer based on the SoundFont 2 specifications
Tokenizer-Free TTS for Multilingual Speech Generation
A multi-system chiptune tracker compatible with DefleMask modules
Collaborative programmable music
Controllable & emotion-expressive zero-shot TTS
Functional programming language for signal processing
Free open source speech synthesizer for Russian and other languages
Open Source Speech Language Model
Framework for building real-time voice and multimodal AI agents
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Transforming Multimodal Content into Captivating Multilingual Audio
A Systematic Framework for Interactive World Modeling
Offline Text To Speech synthesis for python
Translate the video from one language to another and embed dubbing
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Capable of understanding text, audio, vision, video
Stable diffusion for real-time music generation (web app)
Swift audio synthesis, processing, & analysis platform