Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
4M: Massively Multimodal Masked Modeling
Open-weight, large-scale hybrid-attention reasoning model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Tooling for the Common Objects In 3D dataset
Uncommon Objects in 3D dataset
AlphaFold 3 inference pipeline
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A Family of Open Foundation Models for Code Intelligence
New family of code large language models (LLMs)
FlashMLA: Efficient Multi-head Latent Attention Kernels
Towards Real-World Vision-Language Understanding
Release for Improved Denoising Diffusion Probabilistic Models
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
LLaMA: Open and Efficient Foundation Language Models
Per-Pixel Classification is Not All You Need for Semantic Segmentation