AI assistant that supports knowledge bases, model APIs
Fast, local-first web content extraction for LLMs
LLM inference in C/C++
Quick illustration of how one can easily read books together with LLMs
Query anything (GitHub, Notion, +40 more) with SQL and let LLMs
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
High-speed Large Language Model Serving for Local Deployment
Fast and efficient unstructured data extraction
Fast, flexible LLM inference
AI search engine - self-host with local or cloud LLMs
Run AI models locally on your machine with node.js bindings for llama
Masks sensitive data and secrets before they reach AI
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
local-first semantic code search engine
Web app for interacting with any LangGraph agent (PY & TS) via a chat
TokenSpeed is a speed-of-light LLM inference engine
Fully private LLM chatbot that runs entirely with a browser
Auto-GPT on the browser
This website is a free, open-source guide on prompt engineering