Model export recipes, Python primitives, and Swift runtime utilities
26m function call model that runs on incredibly small devices
Official repository for LTX-Video
Port of Facebook's LLaMA model in C/C++
Awesome multilingual OCR toolkits based on PaddlePaddle
Native and Compact Structured Latents for 3D Generation
Implementation of "MobileCLIP" CVPR 2024
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Python inference and LoRA trainer package for the LTX-2 audio–video
Flux 2 image generation model pure C inference
Tiny vision language model
AlphaFold 3 inference pipeline
The most powerful local music generation model
Z80-μLM is a 2-bit quantized language model
Python SDK for Claude Agent
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
PyTorch code and models for the DINOv2 self-supervised learning
Stable Diffusion with Core ML on Apple Silicon
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Unified Multimodal Understanding and Generation Models
Text and image to video generation: CogVideoX and CogVideo
Multimodal embedding and reranking models built on Qwen3-VL
Instructions on how to use the Realtime API on Microcontrollers
Fast, Sharp & Reliable Agentic Intelligence