Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio server, programming language, and IDE for sound synthesis
Large Audio Language Model built for natural interactions
The open-source voice synthesis studio powered by Qwen3-TTS
Multi-modal large language model designed for audio understanding
Software synthesizer based on the SoundFont 2 specifications
Sonic Pi is your free code-based music creation and performance tool
Tokenizer-Free TTS for Multilingual Speech Generation
Collaborative programmable music
A multi-system chiptune tracker compatible with DefleMask modules
Free open source speech synthesizer for Russian and other languages
Functional programming language for signal processing
Controllable & emotion-expressive zero-shot TTS
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Transforming Multimodal Content into Captivating Multilingual Audio
Framework for building real-time voice and multimodal AI agents
A Systematic Framework for Interactive World Modeling
Translate the video from one language to another and embed dubbing
Capable of understanding text, audio, vision, video
Open Source Speech Language Model
Offline Text To Speech synthesis for python
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Swift audio synthesis, processing, & analysis platform
Industrial-level controllable zero-shot text-to-speech system