Bailing is a voice dialogue robot similar to GPT-4o
Interface for OuteTTS models
A lightweight text-to-speech model with zero-shot voice cloning
StreamSpeech is a seamless model for offline speech recognition
Build Vision Agents quickly with any model or video provider
Reading book source
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
Unofficial Parallel WaveGAN
Best practice TTS based on BERT and VITS
Chinese voice dialogue robot/smart speaker project
A webui for different audio related Neural Networks
WaveRNN Vocoder + TTS
Bangla text to speech synthesis in python
The open-source virtual assistant for Ubuntu based Linux distributions
Toolkit for efficient experimentation with Speech Recognition