Image generation model with single-stream diffusion transformer
Fast-stable-diffusion + DreamBooth
From Vibe Coding to Agentic Engineering
State-of-the-art TTS model under 25MB
Python inference and LoRA trainer package for the LTX-2 audio–video
ChatGLM-6B: An Open Bilingual Dialogue Language Model
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
A Customizable Image-to-Video Model based on HunyuanVideo
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Reference PyTorch implementation and models for DINOv3
Flux 2 image generation model pure C inference
Sharp Monocular Metric Depth in Less Than a Second
FlashMLA: Efficient Multi-head Latent Attention Kernels
Text and image to video generation: CogVideoX and CogVideo
Official inference repo for FLUX.2 models
PyTorch implementation of JiT
Advancing Open-source World Models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Easy Docker setup for Stable Diffusion with user-friendly UI
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Hackable and optimized Transformers building blocks