GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Multi-modal large language model designed for audio understanding
Open-source framework for intelligent speech interaction
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Qwen3-omni is a natively end-to-end, omni-modal LLM
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
LLM-based Reinforcement Learning audio edit model
Capable of understanding text, audio, vision, video
A state-of-the-art open visual language model
Towards Real-World Vision-Language Understanding
The ChatGPT Retrieval Plugin lets you easily find personal documents
Pushing the Limits of Mathematical Reasoning in Open Language Models
High-Resolution Image Synthesis with Latent Diffusion Models
Chat & pretrained large vision language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Open-source, high-performance Mixture-of-Experts large language model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Powerful open source image generation model
A Conversational Speech Generation Model
Open Multilingual Multimodal Chat LMs
Release for Improved Denoising Diffusion Probabilistic Models
Official DeiT repository
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project