A SOTA open-source image editing model
LLM-based Reinforcement Learning audio edit model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Powerful open source image generation model
Open Multilingual Multimodal Chat LMs
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Fine-tuning ChatGLM-6B with PEFT
Official PyTorch Implementation of "Scalable Diffusion Models"
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Code release for ConvNeXt V2 model
A minimal PyTorch re-implementation of the OpenAI GPT
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
GLIDE: a diffusion-based text-conditional image synthesis model
Large-scale autoregressive pixel model for image generation by OpenAI
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for the paper "Improved Techniques for Training GANs"
Code for reproducing key results in the paper
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
JetBrains’ 4B parameter code model for completions
OpenAI’s compact 20B open model for fast, agentic, and local use
Vision-language-action model for robot control via images and text