Official PyTorch Implementation of "Scalable Diffusion Models"
High-Resolution Image Synthesis with Latent Diffusion Models
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Suite with Real-ESRGAN, BSRGAN , IRCNN, GFPGAN & RIFE. v4.3
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-Source Financial Large Language Models!
Powerful open source image generation model
Open-source, high-performance Mixture-of-Experts large language model
A Conversational Speech Generation Model
A method to increase the speed and lower the memory footprint
LLaMA: Open and Efficient Foundation Language Models
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Open Multilingual Multimodal Chat LMs
Implementation of model parallel autoregressive transformers on GPUs
Code release for ConvNeXt V2 model
A minimal PyTorch re-implementation of the OpenAI GPT
A collection of high-quality models for the MuJoCo physics engine
Reference implementation of the Transformer architecture optimized
Learning to Act by Watching Unlabeled Online Videos
Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model