Production-tested AI infrastructure tools
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Image generation model with single-stream diffusion transformer
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Advanced language and coding AI model
Powerful AI language model (MoE) optimized for efficiency/performance
Port of Facebook's LLaMA model in C/C++
Agentic, Reasoning, and Coding (ARC) foundation models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Open-source, high-performance AI model with advanced reasoning
Official inference repo for FLUX.2 models
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
HY-Motion model for 3D character animation generation
An experimental version of DeepSeek model
Python inference and LoRA trainer package for the LTX-2 audio–video
Python bindings for llama.cpp
Lets make video diffusion practical
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Industrial-level controllable zero-shot text-to-speech system
Reference PyTorch implementation and models for DINOv3
Qwen2.5-VL is the multimodal large language model series
State-of-the-art TTS model under 25MB
Qwen-Image is a powerful image generation foundation model