Code for openai.fm, a demo for the OpenAI Speech API
SOTA Open Source TTS
State-of-the-art TTS model under 25MB
Industrial-level controllable zero-shot text-to-speech system
Towards Human-Sounding Speech
Miso TTS is an 8 billion, highly emotive text-to-speech model
Controllable and fast Text-to-Speech for over 7000 languages
Toolkit for conversational AI
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
SOTA discrete acoustic codec models with 40/75 tokens per second
Virtual AI anchor that combines state-of-the-art technology
Towards Human-Level Text-to-Speech through Style Diffusion
Toolkit for audio, music, and speech generation
Unofficial Parallel WaveGAN
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2