Python example app from the OpenAI API quickstart tutorial
Lets make video diffusion practical
Language modeling in a sentence representation space
Official DeiT repository
A Conversational Speech Generation Model
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Learning to Act by Watching Unlabeled Online Videos
PyTorch implementation of MAE
Code for reproducing key results in the paper