Get up and running with Llama 2 and other large language models
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Drag & drop UI to build your customized LLM flow
Port of Facebook's LLaMA model in C/C++
A high-throughput and memory-efficient inference and serving engine
Freedom GPT Electron app
LLM Frontend for Power Users
Locally run an Instruction-Tuned Chat-Style LLM
Application that simplifies the installation of AI-related projects
Self-hosted, community-driven, local OpenAI compatible API
OpenDAN is an open source Personal AI OS
Distribute and run LLMs with a single file
The all-in-one Desktop & Docker AI application with full RAG and AI
A RWKV management and startup tool, full automation, only 8MB
Framework and no-code GUI for fine-tuning LLMs
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Integrate cutting-edge LLM technology quickly and easily into your app
Chat with LLM like Vicuna totally in your browser with WebGPU
One API for plugins and datasets, one interface for prompt engineering
Toolkit for conversational AI
This website is a free, open-source guide on prompt engineering
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Text generator is a handy plugin for Obsidian
Implementation of model parallel autoregressive transformers on GPUs
Low-code app builder for RAG and multi-agent AI applications