Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Repo of Qwen2-Audio chat & pretrained large audio language model
An easy 1-click way to create beautiful artwork on your PC using AI
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen2.5-VL is the multimodal large language model series
Code for running inference with the SAM 3D Body Model 3DB
Detect faces in an image
Unified Multimodal Understanding and Generation Models
Block Diffusion for Ultra-Fast Speculative Decoding
Qwen-Image is a powerful image generation foundation model
Open-source large language model family from Tencent Hunyuan
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A Powerful Native Multimodal Model for Image Generation
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
A series of math-specific large language models of our Qwen2 series
Implementation of the Surya Foundation Model for Heliophysics
Capable of understanding text, audio, vision, video
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Blazeface is a lightweight model that detects faces in images
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Powerful open source image generation model
Open source large language model by Alibaba
A Conversational Speech Generation Model
Implementation of model parallel autoregressive transformers on GPUs
Lightweight multimodal translation model for 55 languages