Toolkit for conversational AI
End-to-end speech processing toolkit
A generative speech model for daily dialogue
A high-quality rapid TTS voice cloning model
Python library and CLI tool to interface with Google Translate
SoTA open-source TTS
Offline inference engine for art, real-time voice conversations
Virtual AI anchor that combines state-of-the-art technology
Scalable generative AI framework built for researchers and developers
Foundational model for human-like, expressive TTS
Towards Human-Sounding Speech
An Open Source text-to-speech system built by inverting Whisper
Toolkit for audio, music, and speech generation
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
WaveRNN Vocoder + TTS
General Speech Restoration
Implementation of a Transformer based neural network
Toolkit for efficient experimentation with Speech Recognition
TensorFlow Implementation of DC-TTS: yet another text-to-speech model