Port of Facebook's LLaMA model in C/C++
CLI proxy that reduces LLM token consumption
A lightweight vLLM implementation built from scratch
Run Local LLMs on Any Device. Open-source
A lightweight framework for building LLM-based agents
Phi-3.5 for Mac: Locally-run Vision and Language Models
New set of lightweight state-of-the-art, open foundation models
Quick illustration of how one can easily read books together with LLMs
the terminal client for Ollama
Personal AI Notebooks. Organize files & webpages and generate notes
lightweight package to simplify LLM API calls
Clippy, now with some AI
Real-time NVIDIA GPU dashboard
An elegant AI chat client. Full-featured, lightweight
Implementation for MatMul-free LM
Apple Intelligence from the command line
A Ruby Implementation of the Model Context Protocol
Vim plugin for LLM-assisted code/text completion
Cybersecurity AI (CAI), the framework for AI Security
CodeGeeX2: A More Powerful Multilingual Code Generation Model
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
On the Structural Pruning of Large Language Models
AI-powered markdown editor - leverage LLMs with your documents
Fast Multimodal LLM on Mobile Devices
Document (PDF, Word, PPTX ...) extraction and parse API