Run Local LLMs on Any Device. Open-source
MLX: An array framework for Apple silicon
Fast LLM speculative inference server for consumer hardware
Powerful Android AI agent with tools, automation, and Linux shell
LiteRT-LM is Google's production-ready inference framework
Agentic browser; privacy-first alternative to ChatGPT Atlas
High-speed Large Language Model Serving for Local Deployment
LLM inference in C/C++
OCR offline image text recognition command line windows program
AlphaFold 3 inference pipeline
Speech Note Linux app. Note taking, reading and translating
Production ready toolkit to run AI locally
Fast Multimodal LLM on Mobile Devices
Fast, Sharp & Reliable Agentic Intelligence
TT-NN operator library, and TT-Metalium low level kernel programming
A scalable inference server for models optimized with OpenVINO
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
ONNX-TensorRT: TensorRT backend for ONNX
Declarative way to run AI models in React Native on device
Low-latency AI inference engine optimized for mobile devices
Cross-platform, customizable ML solutions
Local AI file organization with categorization and rename suggestions
Visual Automation IDE — automate anything you see on screen
A cross-platform video structuring (video analysis) framework
Windows application to search multiple pdfs and chat with them