Powerful AI language model (MoE) optimized for efficiency/performance
Official inference repo for FLUX.2 models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Chat & pretrained large audio language model proposed by Alibaba Cloud
Open-source multi-speaker long-form text-to-speech model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Models for object and human mesh reconstruction
Official repository for LTX-Video
Diffusion Transformer with Fine-Grained Chinese Understanding
ChatGPT interface with better UI
A state-of-the-art open visual language model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Qwen-Image is a powerful image generation foundation model
Long-form streaming TTS system for multi-speaker dialogue generation
Qwen2.5-VL is the multimodal large language model series
Advanced language and coding AI model
Reference PyTorch implementation and models for DINOv3
General-purpose image editing model that delivers high-fidelity
New family of code large language models (LLMs)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Chat & pretrained large vision language model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Customizable Image-to-Video Model based on HunyuanVideo
The official repo of Qwen chat & pretrained large language model