Implementation of Phenaki Video, which uses Mask GIT
Implementation of AudioLM audio generation model in Pytorch
C++ Implementation of PyTorch Tutorials for Everyone
Simplest working implementation of Stylegan2
Implementation of Video Diffusion Models
Toolkit for conversational AI
Fast image augmentation library and an easy-to-use wrapper
Implementation of Make-A-Video, new SOTA text to video generator
State-of-the-art diffusion models for image and audio generation
Multilingual sentence & image embeddings with BERT
Tensor search for humans
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Implementation of Recurrent Interface Network (RIN)
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Implementation of MusicLM music generation model in Pytorch
textgen, Text Generation models
CLIP + FFT/DWT/RGB = text to image/video
Run 100B+ language models at home, BitTorrent-style
Implementation / replication of DALL-E, OpenAI's Text to Image
Audio generation using diffusion models, in PyTorch
MMGeneration is a powerful toolkit for generative models
Implementation of NÜWA, attention network for text to video synthesis
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Locally run an Instruction-Tuned Chat-Style LLM
Audio generation using diffusion models