Using OpenAI's Whisper to automatically generate YouTube subtitles
Based on the Disco Diffusion, version of the AI art creation software
Implementation of NWT, audio-to-video generation, in Pytorch
A data augmentations library for audio, image, text, and video
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Dia-1.6B generates lifelike English dialogue and vocal expressions