Run Local LLMs on Any Device. Open-source
A lightweight vLLM implementation built from scratch
A lightweight framework for building LLM-based agents
Phi-3.5 for Mac: Locally-run Vision and Language Models
the terminal client for Ollama
lightweight package to simplify LLM API calls
Quick illustration of how one can easily read books together with LLMs
Implementation for MatMul-free LM
Cybersecurity AI (CAI), the framework for AI Security
CodeGeeX2: A More Powerful Multilingual Code Generation Model
On the Structural Pruning of Large Language Models
Document (PDF, Word, PPTX ...) extraction and parse API
Skywork-R1V is an advanced multimodal AI model series
Run PyTorch LLMs locally on servers, desktop and mobile
LightLLM is a Python-based LLM (Large Language Model) inference
Specify a github or local repo, github pull request
GLM-4 series: Open Multilingual Multimodal Chat LMs
Performance-optimized AI inference on your GPUs
Collect, organize, use, and share, all in OmniBox
MobileLLM Optimizing Sub-billion Parameter Language Models
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Run LLM prompts from your shell
Adding guardrails to large language models
Inference Llama 2 in one file of pure C
SimpleMem: Efficient Lifelong Memory for LLM Agents