Powerful AI language model (MoE) optimized for efficiency/performance
Chat & pretrained large audio language model proposed by Alibaba Cloud
Advanced language and coding AI model
Official inference repo for FLUX.2 models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Models for object and human mesh reconstruction
A state-of-the-art open visual language model
Diffusion Transformer with Fine-Grained Chinese Understanding
Open-source multi-speaker long-form text-to-speech model
Qwen-Image is a powerful image generation foundation model
ChatGPT interface with better UI
Official repository for LTX-Video
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Controllable & emotion-expressive zero-shot TTS
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen2.5-VL is the multimodal large language model series
RGBD video generation model conditioned on camera input
OCR expert VLM powered by Hunyuan's native multimodal architecture
Pokee Deep Research Model Open Source Repo
New family of code large language models (LLMs)
Reference PyTorch implementation and models for DINOv3
Qwen3-omni is a natively end-to-end, omni-modal LLM
The official repo of Qwen chat & pretrained large language model