FlashMLA: Efficient Multi-head Latent Attention Kernels
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Port of Facebook's LLaMA model in C/C++
Advanced language and coding AI model
Image generation model with single-stream diffusion transformer
Agentic, Reasoning, and Coding (ARC) foundation models
Code for running inference and finetuning with SAM 3 model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
Official inference repo for FLUX.2 models
Qwen3 is the large language model series developed by Qwen team
Wan2.1: Open and Advanced Large-Scale Video Generative Model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
RGBD video generation model conditioned on camera input
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Qwen3-Coder is the code version of Qwen3
Python inference and LoRA trainer package for the LTX-2 audio–video
Models for object and human mesh reconstruction
Reference PyTorch implementation and models for DINOv3
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Qwen2.5-VL is the multimodal large language model series