Port of Facebook's LLaMA model in C/C++
Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
Port of OpenAI's Whisper model in C/C++
Open-source vector similarity search for Postgres
Run OpenClaw on a $5 chip
AI video generator optimized for low VRAM and older GPUs use
The Operator Splitting QP Solver
Flux 2 image generation model pure C inference
TEN, a voice agent framework to create conversational AI.
C++ and Python Examples
Next-gen AI+IoT framework for T2/T3/T5AI/ESP32/and more
ESP32 desk dashboard that shows Claude Code usage
Your personal AI assistant at all-in 888KiB
Open-source framework for conversational voice AI agents
kaldi-asr/kaldi is the official location of the Kaldi project
FAIR Sequence Modeling Toolkit 2
Android inline hook library which supports thumb, arm32 and arm64
DeepSeek 4 Flash local inference engine for Metal
A fast image processing library with low memory needs
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
Foundational Models for State-of-the-Art Speech and Text Translation
Provides CTP stock options and Zhongtai Securities XTP
C++ inference library for multiple SVC/TTS
llama and other large language models on iOS and MacOS offline
Low-latency AI inference engine optimized for mobile devices