Agentic, Reasoning, and Coding (ARC) foundation models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Z80-μLM is a 2-bit quantized language model
A state-of-the-art open visual language model
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Large language model developed and released by NVIDIA
Large-scale xAI model for local inference with SGLang, Grok-2.5
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Instruction-tuned 1.2B LLM for multilingual text generation by Meta
Compact 3B-param multimodal model for efficient on-device reasoning