Qwen2.5-VL is the multimodal large language model series
Analyze computation-communication overlap in V3/R1
The ChatGPT Retrieval Plugin lets you easily find personal documents
Access to Anthropic's safety-first language model APIs
Open source large language model by Alibaba
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
Compact 8B multimodal instruct model optimized for edge deployment
Efficient 8B multimodal model tuned for advanced reasoning tasks.
High-precision 14B multimodal model built for advanced reasoning tasks
Ultra-efficient 3B multimodal instruct model built for edge deployment
Efficient 14B multimodal instruct model with edge deployment and FP8
Compact 3B-param multimodal model for efficient on-device reasoning
QwQ-32B is a reasoning-focused language model for complex tasks
Multimodal 7B model for image, video, and text understanding tasks
Powerful 14B LLM with strong instruction and long-text handling
Speculative-decoding accelerator for the 675B Mistral Large 3
Quantized 675B multimodal instruct model optimized for NVFP4
Frontier-scale 675B multimodal instruct MoE model for enterprise AIMis