Port of Facebook's LLaMA model in C/C++
AlphaFold 3 inference pipeline
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
High-Resolution Image Synthesis with Latent Diffusion Models
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Code for running inference and finetuning with SAM 3 model
A Customizable Image-to-Video Model based on HunyuanVideo
Achieving 3+ generation speedup on reasoning tasks
Official inference repo for FLUX.1 models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Ling is a MoE LLM provided and open-sourced by InclusionAI
Qwen3 is the large language model series developed by Qwen team
Official Python inference and LoRA trainer package
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Phi-3.5 for Mac: Locally-run Vision and Language Models
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
OCR expert VLM powered by Hunyuan's native multimodal architecture
Inference script for Oasis 500M
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Fast stable diffusion on CPU and AI PC
A Powerful Native Multimodal Model for Image Generation
RGBD video generation model conditioned on camera input
Agentic, Reasoning, and Coding (ARC) foundation models