Tools for merging pretrained large language models
A high-performance inference engine for AI models
Fast, flexible LLM inference
VS Code extension for LLM-assisted code/text completion
Calculate token/s & GPU memory requirement for any LLM
An ecosystem of Rust libraries for working with large language models