Next-gen AI+IoT framework for T2/T3/T5AI/ESP32/and more
Your fully private, open-source, on-device AI assistant
Run Local LLMs on Any Device. Open-source
Production ready toolkit to run AI locally
Fast, flexible LLM inference
Chat with private and local large language models
AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake
MobileLLM Optimizing Sub-billion Parameter Language Models
Low-latency REST API for serving text-embeddings
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Phi-3.5 for Mac: Locally-run Vision and Language Models
LLM-enabled investment tracker that consolidates market performance
Fully private LLM chatbot that runs entirely with a browser
PyTorch library of curated Transformer models and their components
Locally run an Instruction-Tuned Chat-Style LLM
Training and serving large-scale neural networks
Efficient MoE reasoning model for coding and math workloads