Port of Facebook's LLaMA model in C/C++
Powerful AI language model (MoE) optimized for efficiency/performance
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Awesome multilingual OCR toolkits based on PaddlePaddle
Clean and efficient FP8 GEMM kernels with fine-grained scaling
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Flux 2 image generation model pure C inference
AlphaFold 3 inference pipeline
Personalize Any Characters with a Scalable Diffusion Transformer
FAIR Sequence Modeling Toolkit 2
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Towards self-verifiable mathematical reasoning
Fast, Sharp & Reliable Agentic Intelligence
VMZ: Model Zoo for Video Modeling
Open-source large language model family from Tencent Hunyuan
Generate Any 3D Scene in Seconds
Hackable and optimized Transformers building blocks
Safety reasoning models built-upon gpt-oss
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Foundational Models for State-of-the-Art Speech and Text Translation
FlashMLA: Efficient Multi-head Latent Attention Kernels
Real-time behaviour synthesis with MuJoCo, using Predictive Control