Repo of Qwen2-Audio chat & pretrained large audio language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Capable of understanding text, audio, vision, video
Genome modeling and design across all domains of life
Easy Docker setup for Stable Diffusion with user-friendly UI
ChatGPT interface with better UI
A state-of-the-art open visual language model
High-Resolution Image Synthesis with Latent Diffusion Models
Towards Real-World Vision-Language Understanding
The ChatGPT Retrieval Plugin lets you easily find personal documents
AI-powered tool to quickly remove watermarks from images flawlessly
AI Suite for upscaling, interpolating & restoring images/videos
Open-source, high-performance Mixture-of-Experts large language model
Pushing the Limits of Mathematical Reasoning in Open Language Models
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
StudioOllamaUI is a local, portable interface for Ollama
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Chat & pretrained large audio language model proposed by Alibaba Cloud
Powerful open source image generation model
Chat & pretrained large vision language model