Port of Facebook's LLaMA model in C/C++
An LLM Compiler for Parallel Function Calling
DepGraph: Towards Any Structural Pruning
A RWKV management and startup tool, full automation, only 8MB
OpenAI API client for Kotlin with multiplatform capabilities
Specify a github or local repo, github pull request
Your fully private, open-source, on-device AI assistant
Fetch source code for npm packages
local-first semantic code search engine
Distributed LLM and StableDiffusion inference
Inference Llama 2 in one file of pure C
Fully private LLM chatbot that runs entirely with a browser
Run LLMs locally on Cloud Workstations