Powerful AI language model (MoE) optimized for efficiency/performance
An experimental version of DeepSeek model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Official inference repo for FLUX.2 models
General-purpose image editing model that delivers high-fidelity
Open-source multi-speaker long-form text-to-speech model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Long-form streaming TTS system for multi-speaker dialogue generation
A state-of-the-art open visual language model
ChatGPT interface with better UI
Advanced language and coding AI model
Diffusion Transformer with Fine-Grained Chinese Understanding
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Customizable Image-to-Video Model based on HunyuanVideo
Official repository for LTX-Video
Models for object and human mesh reconstruction
Qwen-Image is a powerful image generation foundation model
New family of code large language models (LLMs)
FAIR Sequence Modeling Toolkit 2
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen2.5-VL is the multimodal large language model series
Qwen3-omni is a natively end-to-end, omni-modal LLM
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
The official repo of Qwen chat & pretrained large language model