LLM-based Reinforcement Learning audio edit model
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A Powerful Native Multimodal Model for Image Generation
The ChatGPT Retrieval Plugin lets you easily find personal documents
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
Release for Improved Denoising Diffusion Probabilistic Models
Open source large language model by Alibaba
Open Multilingual Multimodal Chat LMs
Powerful open source image generation model
DeepSeek LLM: Let there be answers
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Official code for Style Aligned Image Generation via Shared Attention
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Fine-tuning ChatGLM-6B with PEFT
Official PyTorch Implementation of "Scalable Diffusion Models"
llama.go is like llama.cpp in pure Golang
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Code release for ConvNeXt V2 model
A minimal PyTorch re-implementation of the OpenAI GPT
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
GLIDE: a diffusion-based text-conditional image synthesis model
Large-scale autoregressive pixel model for image generation by OpenAI