Offline inference engine for art, real-time voice conversations
Python library and CLI tool to interface with Google Translate
SOTA discrete acoustic codec models with 40/75 tokens per second
Unofficial Parallel WaveGAN
Process large speech data wrt transcription, labeling and annotation