DeepSeek LLM: Let there be answers
Strong, Economical, and Efficient Mixture-of-Experts Language Model
An AI-powered security review GitHub Action using Claude
Models for object and human mesh reconstruction
Qwen3-Coder is the code version of Qwen3
Qwen2.5-VL is the multimodal large language model series
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Analyze computation-communication overlap in V3/R1
The official PyTorch implementation of Google's Gemma models
Towards Real-World Vision-Language Understanding
Uncommon Objects in 3D dataset
The ChatGPT Retrieval Plugin lets you easily find personal documents
800,000 step-level correctness labels on LLM solutions to MATH problem
Large-scale xAI model for local inference with SGLang, Grok-2.5