Qwen2.5-VL is the multimodal large language model series
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Free, local, open-source AI app builder
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Qwen3-omni is a natively end-to-end, omni-modal LLM
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
An open-source convolutional neural networks platform for research
Large-scale xAI model for local inference with SGLang, Grok-2.5
Reasoning-powered OCR VLM for converting complex documents to Markdown