CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Reference PyTorch implementation and models for DINOv3
ChatGLM-6B: An Open Bilingual Dialogue Language Model
The official repo of Qwen chat & pretrained large language model
Generate Any 3D Scene in Seconds
Pushing the Limits of Mathematical Reasoning in Open Language Models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Qwen-Image is a powerful image generation foundation model
A Customizable Image-to-Video Model based on HunyuanVideo
Qwen3-Coder is the code version of Qwen3
High-Resolution Image Synthesis with Latent Diffusion Models
The official PyTorch implementation of Google's Gemma models
Foundation Models for Time Series
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Multimodal Diffusion with Representation Alignment
Industrial-level controllable zero-shot text-to-speech system
Large Multimodal Models for Video Understanding and Editing
Revolutionizing Database Interactions with Private LLM Technology
Uncommon Objects in 3D dataset
Towards Real-World Vision-Language Understanding
CLIP, Predict the most relevant text snippet given an image
Diversity-driven optimization and large-model reasoning ability
Release for Improved Denoising Diffusion Probabilistic Models
Capable of understanding text, audio, vision, video