A SOTA open-source image editing model
Chinese and English multimodal conversational language model
Repo of Qwen2-Audio chat & pretrained large audio language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Fidelity and Controllable Generation of Textured 3D Assets
Multi-modal large language model designed for audio understanding
Open-source framework for intelligent speech interaction
RGBD video generation model conditioned on camera input
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Capable of understanding text, audio, vision, video
Easy Docker setup for Stable Diffusion with user-friendly UI
A state-of-the-art open visual language model
High-Resolution Image Synthesis with Latent Diffusion Models
Towards Real-World Vision-Language Understanding
The ChatGPT Retrieval Plugin lets you easily find personal documents
Open-source, high-performance Mixture-of-Experts large language model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Pushing the Limits of Mathematical Reasoning in Open Language Models
Powerful open source image generation model
Chat & pretrained large vision language model
A Conversational Speech Generation Model