LLM-based Reinforcement Learning audio edit model
Chat & pretrained large vision language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
High-Resolution Image Synthesis with Latent Diffusion Models
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
A Conversational Speech Generation Model
Open Multilingual Multimodal Chat LMs
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Official code for Style Aligned Image Generation via Shared Attention
Dataset of GPT-2 outputs for research in detection, biases, and more
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Let us control diffusion models
Official repo for consistency models
Fine-tuning ChatGLM-6B with PEFT
Official PyTorch Implementation of "Scalable Diffusion Models"
Repo for external large-scale work
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A collection of high-quality models for the MuJoCo physics engine
Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
An implementation of model parallel GPT-2 and GPT-3-style models
The official pytorch implementation of our paper
Large-scale autoregressive pixel model for image generation by OpenAI