Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Tooling for the Common Objects In 3D dataset
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
GLM-4-Voice | End-to-End Chinese-English Conversational Model
GPT4V-level open-source multi-modal model based on Llama3-8B
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Open-weight, large-scale hybrid-attention reasoning model
Qwen-Image is a powerful image generation foundation model
The Clay Foundation Model - An open source AI model and interface
Netease Youdao's open-source embedding and reranker models
An Efficient Agentic Model for Computer Use
Audio foundation model excelling in audio understanding
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
The official PyTorch implementation of Google's Gemma models
A 0.1B Omni model trained from scratch
26m function call model that runs on incredibly small devices
Open Source Speech Language Model
Qwen3-ASR is an open-source series of ASR models
A Pragmatic VLA Foundation Model
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Block Diffusion for Ultra-Fast Speculative Decoding
Collection of Gemma 3 variants that are trained for performance
Implementation of "MobileCLIP" CVPR 2024
VMZ: Model Zoo for Video Modeling