Code for openai.fm, a demo for the OpenAI Speech API
SOTA Open Source TTS
State-of-the-art TTS model under 25MB
Industrial-level controllable zero-shot text-to-speech system
Miso TTS is an 8 billion, highly emotive text-to-speech model
Towards Human-Sounding Speech
Controllable and fast Text-to-Speech for over 7000 languages
Toolkit for conversational AI
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
SOTA discrete acoustic codec models with 40/75 tokens per second
Virtual AI anchor that combines state-of-the-art technology
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Towards Human-Level Text-to-Speech through Style Diffusion
Toolkit for audio, music, and speech generation
Unofficial Parallel WaveGAN
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Transcription Aid helps you type text from recordings.