Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Implementation of "MobileCLIP" CVPR 2024
Official implementation of DreamCraft3D
The ChatGPT Retrieval Plugin lets you easily find personal documents
The official PyTorch implementation of Google's Gemma models
Inference script for Oasis 500M
ICLR2024 Spotlight: curation/training code, metadata, distribution
A Customizable Image-to-Video Model based on HunyuanVideo
code for Mesh R-CNN, ICCV 2019
Implementation of the Surya Foundation Model for Heliophysics
A SOTA open-source image editing model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
LLM-based Reinforcement Learning audio edit model
Official code for Style Aligned Image Generation via Shared Attention
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
Powerful open source image generation model
Open Multilingual Multimodal Chat LMs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Fine-tuning ChatGLM-6B with PEFT
Official PyTorch Implementation of "Scalable Diffusion Models"
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Code release for ConvNeXt V2 model
A minimal PyTorch re-implementation of the OpenAI GPT
Reference implementation of the Transformer architecture optimized