Fast stable diffusion on CPU and AI PC
Agentic, Reasoning, and Coding (ARC) foundation models
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Visual Causal Flow
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Code for running inference and finetuning with SAM 3 model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
State-of-the-art TTS model under 25MB
Easy Docker setup for Stable Diffusion with user-friendly UI
A Customizable Image-to-Video Model based on HunyuanVideo
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Advanced language and coding AI model
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
An experimental version of DeepSeek model
Capable of understanding text, audio, vision, video
MiniMax M2.1, a SOTA model for real-world dev & agents.
Text and image to video generation: CogVideoX and CogVideo
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
An Efficient Agentic Model for Computer Use
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The official PyTorch implementation of Google's Gemma models
Block Diffusion for Ultra-Fast Speculative Decoding
Large-language-model & vision-language-model based on Linear Attention