ChatGLM-6B: An Open Bilingual Dialogue Language Model
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Open-source evaluation toolkit of large multi-modality models (LMMs)
AirLLM 70B inference with single 4GB GPU
Completely free, private, UI based Tech Documentation MCP server
High-speed Large Language Model Serving for Local Deployment
Open-source tool to visualise your RAG
Run Mixtral-8x7B models in Colab or consumer desktops
Open-source, high-performance Mixture-of-Experts large language model
Building Mixture-of-Experts from LLaMA with Continual Pre-training
Easy-to-use headless React Hooks to run LLMs in the browser with WebGP
llama.go is like llama.cpp in pure Golang