The official Python SDK for the ElevenLabs API
Speech-AI-Forge is a project developed around TTS generation model
A fast TTS architecture with conditional flow matching
Build Vision Agents quickly with any model or video provider
Towards Human-Level Text-to-Speech through Style Diffusion
Singing Voice Synthesis via Shallow Diffusion Mechanism
Pre-trained and Reproduced Deep Learning Models
Process large speech data wrt transcription, labeling and annotation