Robust Speech Recognition via Large-Scale Weak Supervision
Open-source industrial-grade ASR models
Industrial-level controllable zero-shot text-to-speech system
End-to-end speech processing toolkit
A Conversational Speech Generation Model
Data manipulation and transformation for audio signal processing
Singing Voice Synthesis via Shallow Diffusion Mechanism
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Facebook AI research's automatic speech recognition toolkit
Toolkit for efficient experimentation with Speech Recognition
Open source speech models for Julius in English and other languages.
Beamforming and Speech Recognition Toolkit