DeepSeek 4 Flash local inference engine for Metal
Port of OpenAI's Whisper model in C/C++
Inference Llama 2 in one file of pure C
Port of Facebook's LLaMA model in C/C++
High-performance code intelligence MCP server
Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
ESP32 desk dashboard that shows Claude Code usage
C++ and Python Examples
Flux 2 image generation model pure C inference
Android inline hook library which supports thumb, arm32 and arm64
The Operator Splitting QP Solver
Open-source vector similarity search for Postgres
Open-source framework for conversational voice AI agents
AI video generator optimized for low VRAM and older GPUs use
ByteHook is an Android PLT hook library
kaldi-asr/kaldi is the official location of the Kaldi project
Provides CTP stock options and Zhongtai Securities XTP
C++ inference library for multiple SVC/TTS
Run OpenClaw on a $5 chip
Next-gen AI+IoT framework for T2/T3/T5AI/ESP32/and more
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
FAIR Sequence Modeling Toolkit 2
TEN, a voice agent framework to create conversational AI.
A fast image processing library with low memory needs
Your personal AI assistant at all-in 888KiB