Repo of Qwen2-Audio chat & pretrained large audio language model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Pretrained time-series foundation model developed by Google Research
Long-form streaming TTS system for multi-speaker dialogue generation
The official PyTorch implementation of Google's Gemma models
Inference script for Oasis 500M
Collection of Gemma 3 variants that are trained for performance
Implementation of "MobileCLIP" CVPR 2024
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official implementation of DreamCraft3D
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
The ChatGPT Retrieval Plugin lets you easily find personal documents
Implementation of the Surya Foundation Model for Heliophysics
Example Discord bot written in Python that uses the completions API
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
Powerful open source image generation model
Official code for Style Aligned Image Generation via Shared Attention
Open Multilingual Multimodal Chat LMs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Fine-tuning ChatGLM-6B with PEFT
Official PyTorch Implementation of "Scalable Diffusion Models"