Hackable and optimized Transformers building blocks
PyTorch code and models for the DINOv2 self-supervised learning
GLM-4-Voice | End-to-End Chinese-English Conversational Model
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
A series of math-specific large language models of our Qwen2 series
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Pretrained time-series foundation model developed by Google Research
Netease Youdao's open-source embedding and reranker models
An Efficient Agentic Model for Computer Use
Audio foundation model excelling in audio understanding
Tiny vision language model
A 0.1B Omni model trained from scratch
26m function call model that runs on incredibly small devices
Open Source Speech Language Model
Long-form streaming TTS system for multi-speaker dialogue generation
Qwen3-ASR is an open-source series of ASR models
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
Z80-μLM is a 2-bit quantized language model
Collection of Gemma 3 variants that are trained for performance
High-resolution models for human tasks
Tool for exploring and debugging transformer model behaviors
CLIP, Predict the most relevant text snippet given an image