Implementation of Phenaki Video, which uses Mask GIT
Implementation of AudioLM audio generation model in Pytorch
C++ Implementation of PyTorch Tutorials for Everyone
Simplest working implementation of Stylegan2
Implementation of Video Diffusion Models
Toolkit for conversational AI
Fast image augmentation library and an easy-to-use wrapper
Implementation of Make-A-Video, new SOTA text to video generator
State-of-the-art diffusion models for image and audio generation
Multilingual sentence & image embeddings with BERT
Tensor search for humans
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Implementation of Recurrent Interface Network (RIN)
Implementation of MusicLM music generation model in Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
textgen, Text Generation models
CLIP + FFT/DWT/RGB = text to image/video
Run 100B+ language models at home, BitTorrent-style
Implementation / replication of DALL-E, OpenAI's Text to Image
Audio generation using diffusion models, in PyTorch
Implementation of NÜWA, attention network for text to video synthesis
MMGeneration is a powerful toolkit for generative models
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Locally run an Instruction-Tuned Chat-Style LLM
Audio generation using diffusion models