SOTA Open Source TTS
An Open Source text-to-speech system built by inverting Whisper
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Offline Text To Speech synthesis for python
Repo of Qwen2-Audio chat & pretrained large audio language model
NLP Cloud serves high performance pre-trained or custom models for NER
Multi-lingual large voice generation model, providing inference
Offline inference engine for art, real-time voice conversations
Framework for building neural networks
A simple, high-quality voice conversion tool focused on ease of use
A TTS model capable of generating ultra-realistic dialogue
Instant voice cloning by MIT and MyShell. Audio foundation model
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
A webui for different audio related Neural Networks
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
WaveRNN Vocoder + TTS
Main repository of Project Alice, contains main unit source code
General Speech Restoration
Conditional Variational Autoencoder with Adversarial Learning
Kashgari is a production-level NLP Transfer learning framework
An implementation of Tacotron 2 that supports multilingual experiments
Library of deep learning models and datasets