A SOTA open-source image editing model
Accurate × Fast × Comprehensive
Open-source industrial-grade ASR models
Industrial-level controllable zero-shot text-to-speech system
Pretrained time-series foundation model developed by Google Research
A Conversational Speech Generation Model
Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)