A state-of-the-art open visual language model
One-click local MCP server installation in desktop apps
Multimodal embedding and reranking models built on Qwen3-VL
AI-powered tool to quickly remove watermarks from images flawlessly
Environment generation code for the paper "Emergent Tool Use"
Multimodal Transformer for document image understanding and layout
Frontier-scale 675B multimodal base model for custom AI training
Versatile 8B-base multimodal LLM, flexible foundation for custom AI