Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
Run Local LLMs on Any Device. Open-source
Concatenate a directory full of files into a single prompt
Project aimed at extracting, exporting, and analyzing chat records
A Simple and Universal Swarm Intelligence Engine
The all-in-one Desktop & Docker AI application with full RAG and AI
Clippy, now with some AI
Claude Code opened to any LLM
AirLLM 70B inference with single 4GB GPU
Simple, Pythonic building blocks to evaluate LLM applications
Implementations for various Generative AI Agent techniques
LLM training in simple, raw C/CUDA
The SOTA Open-Source Browser Agent
Fully automatic censorship removal for language models
Convert any URL to an LLM-friendly input with a simple prefix
Quick illustration of how one can easily read books together with LLMs
Bringing large-language models and chat to web browsers
AI Coding agent for the terminal
MiniMax M2.1, a SOTA model for real-world dev & agents.
Mac app for Ollama
950 line, minimal, extensible LLM inference engine built from scratch
Claude + Obsidian knowledge companion
Build a modern LLM from scratch. Every line commented
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Gemma open-weight LLM library, from Google DeepMind