Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio server, programming language, and IDE for sound synthesis
Large Audio Language Model built for natural interactions
The open-source voice synthesis studio powered by Qwen3-TTS
Multi-modal large language model designed for audio understanding
Sonic Pi is your free code-based music creation and performance tool
Software synthesizer based on the SoundFont 2 specifications
Tokenizer-Free TTS for Multilingual Speech Generation
A multi-system chiptune tracker compatible with DefleMask modules
Collaborative programmable music
Functional programming language for signal processing
Controllable & emotion-expressive zero-shot TTS
Free open source speech synthesizer for Russian and other languages
Open Source Speech Language Model
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Framework for building real-time voice and multimodal AI agents
Transforming Multimodal Content into Captivating Multilingual Audio
A Systematic Framework for Interactive World Modeling
Offline Text To Speech synthesis for python
Capable of understanding text, audio, vision, video
Translate the video from one language to another and embed dubbing
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Stable diffusion for real-time music generation (web app)
Swift audio synthesis, processing, & analysis platform