State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Language modeling in a sentence representation space
The ChatGPT Retrieval Plugin lets you easily find personal documents
Code for the paper Hybrid Spectrogram and Waveform Source Separation
High-Resolution Image Synthesis with Latent Diffusion Models
A Conversational Speech Generation Model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open Multilingual Multimodal Chat LMs
Fine-tuning ChatGLM-6B with PEFT
A method to increase the speed and lower the memory footprint
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
Large-scale autoregressive pixel model for image generation by OpenAI
Environment generation code for the paper "Emergent Tool Use"
A mix of GAN implementations including progressive growing
Generate embeddings from large-scale graph-structured data
Code for the paper "Improved Techniques for Training GANs"
JetBrains’ 4B parameter code model for completions
OpenAI’s compact 20B open model for fast, agentic, and local use