New family of code large language models (LLMs)
Controllable & emotion-expressive zero-shot TTS
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Pokee Deep Research Model Open Source Repo
Tooling for the Common Objects In 3D dataset
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Language modeling in a sentence representation space
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Advancing Formal Mathematical Reasoning via Reinforcement Learning
Clean and efficient FP8 GEMM kernels with fine-grained scaling
FlashMLA: Efficient Multi-head Latent Attention Kernels
Renderer for the harmony response format to be used with gpt-oss
Implementation of the Surya Foundation Model for Heliophysics
Safety reasoning models built-upon gpt-oss
Diversity-driven optimization and large-model reasoning ability
A state-of-the-art open visual language model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Large Multimodal Models for Video Understanding and Editing
MiniMax-M2, a model built for Max coding & agentic workflows
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Capable of understanding text, audio, vision, video
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Towards Real-World Vision-Language Understanding