Towards Human-Sounding Speech
TTS with kokoro and onnx runtime
Virtual AI anchor that combines state-of-the-art technology
The official Python SDK for the ElevenLabs API
Multi-lingual large voice generation model, providing inference
Controllable & emotion-expressive zero-shot TTS
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A text-to-speech, speech-to-text and speech-to-speech library
Toolkit for conversational AI
Foundational model for human-like, expressive TTS
Scalable generative AI framework built for researchers and developers
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Offline inference engine for art, real-time voice conversations
Speech-AI-Forge is a project developed around TTS generation model
An Open Source text-to-speech system built by inverting Whisper
Controllable and fast Text-to-Speech for over 7000 languages
Bailing is a voice dialogue robot similar to GPT-4o
Framework for building neural networks
MARS5 speech model (TTS) from CAMB.AI
A simple native web interface that uses ChatTTS to synthesize text
Reading book source
StreamSpeech is a seamless model for offline speech recognition
Toolkit for audio, music, and speech generation
Management of Yandex Station and other smart home devices
VITS2 backbone with multilingual-bert