MOSS-TTS-Nano is an open-source multilingual tiny speech generation
A nearly-live implementation of OpenAI's Whisper
Speech-AI-Forge is a project developed around TTS generation model
FAIR Sequence Modeling Toolkit 2
MARS5 speech model (TTS) from CAMB.AI
A text-to-speech, speech-to-text and speech-to-speech library
Towards Human-Sounding Speech
VITS2 backbone with multilingual-bert
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Implementation of a Transformer based neural network
TensorFlow Implementation of DC-TTS: yet another text-to-speech model