FlashMLA: Efficient Multi-head Latent Attention Kernels
An experimental version of DeepSeek model
Clean and efficient FP8 GEMM kernels with fine-grained scaling
A Powerful Native Multimodal Model for Image Generation
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Advanced language and coding AI model
Image generation model with single-stream diffusion transformer
Code for running inference and finetuning with SAM 3 model
Agentic, Reasoning, and Coding (ARC) foundation models
Port of Facebook's LLaMA model in C/C++
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3 is the large language model series developed by Qwen team
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Official inference repo for FLUX.2 models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
RGBD video generation model conditioned on camera input
Models for object and human mesh reconstruction
From Images to High-Fidelity 3D Assets
Qwen3-Coder is the code version of Qwen3
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Reference PyTorch implementation and models for DINOv3
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Lets make video diffusion practical