Robust Speech Recognition via Large-Scale Weak Supervision
Industrial-level controllable zero-shot text-to-speech system
End-to-end speech processing toolkit
A Conversational Speech Generation Model
Audio codecs extracted from Android Open Source Project
Singing Voice Synthesis via Shallow Diffusion Mechanism
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Toolkit for efficient experimentation with Speech Recognition