Foundation model for image generation
Designed for text embedding and ranking tasks
Open-weight, large-scale hybrid-attention reasoning model
Global weather forecasting model using graph neural networks and JAX
Diversity-driven optimization and large-model reasoning ability
CogView4, CogView3-Plus and CogView3(ECCV 2024)
A series of math-specific large language models of our Qwen2 series
The official repo of Qwen chat & pretrained large language model
Multi-modal large language model designed for audio understanding
code for Mesh R-CNN, ICCV 2019
Qwen2.5-VL is the multimodal large language model series
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Genome modeling and design across all domains of life
Diffusion Transformer with Fine-Grained Chinese Understanding
OCR expert VLM powered by Hunyuan's native multimodal architecture
Achieving 3+ generation speedup on reasoning tasks
GPT4V-level open-source multi-modal model based on Llama3-8B
Open-source large language model family from Tencent Hunyuan
A Multi-Modal World Model for Reconstructing, Generating, Simulation
AlphaFold 3 inference pipeline
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Netease Youdao's open-source embedding and reranker models
Open-source framework for intelligent speech interaction
PyTorch code and models for the DINOv2 self-supervised learning