Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Implementation of "MobileCLIP" CVPR 2024
State-of-the-art TTS model under 25MB
Model export recipes, Python primitives, and Swift runtime utilities
Phi-3.5 for Mac: Locally-run Vision and Language Models
26m function call model that runs on incredibly small devices
Hunyuan Translation Model Version 1.5
Ultra-Efficient LLMs on End Device
Locally run an Instruction-Tuned Chat-Style LLM
Efficient MoE reasoning model for coding and math workloads
Lightweight 24B agentic coding model with vision and long context
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Instruction-tuned 1.2B LLM for multilingual text generation by Meta
Compact 3B-param multimodal model for efficient on-device reasoning