Port of Facebook's LLaMA model in C/C++
CLI proxy that reduces LLM token consumption
Run Local LLMs on Any Device. Open-source
A lightweight vLLM implementation built from scratch
A lightweight framework for building LLM-based agents
Phi-3.5 for Mac: Locally-run Vision and Language Models
New set of lightweight state-of-the-art, open foundation models
Quick illustration of how one can easily read books together with LLMs
the terminal client for Ollama
lightweight package to simplify LLM API calls
Clippy, now with some AI
Personal AI Notebooks. Organize files & webpages and generate notes
Real-time NVIDIA GPU dashboard
An elegant AI chat client. Full-featured, lightweight
Implementation for MatMul-free LM
Apple Intelligence from the command line
A Ruby Implementation of the Model Context Protocol
Vim plugin for LLM-assisted code/text completion
Cybersecurity AI (CAI), the framework for AI Security
CodeGeeX2: A More Powerful Multilingual Code Generation Model
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
On the Structural Pruning of Large Language Models
AI-powered markdown editor - leverage LLMs with your documents
Fast Multimodal LLM on Mobile Devices
Document (PDF, Word, PPTX ...) extraction and parse API