SimpleMem: Efficient Lifelong Memory for LLM Agents
ChatGLM-6B: An Open Bilingual Dialogue Language Model
MemoryOS is designed to provide a memory operating system
AirLLM 70B inference with single 4GB GPU
Accessible large language models via k-bit quantization for PyTorch
A high-throughput and memory-efficient inference and serving engine
⚡ Building applications with LLMs through composability ⚡
Neural Network architecture based on ideas of the original LSTM
A Simple and Universal Swarm Intelligence Engine
Redundancy-aware KV Cache Compression for Reasoning Models
Claude + Obsidian knowledge companion
Unified KV Cache Compression Methods for Auto-Regressive Models
CodeGeeX2: A More Powerful Multilingual Code Generation Model
A Telegram bot for Large Language Models
State-of-the-art Parameter-Efficient Fine-Tuning
AI-powered penetration testing assistant using local LLM on linux
Maimaibot, a (more focused) multi-platform intelligent agent
A frontier, first-principles handbook
LLM training in simple, raw C/CUDA
Low-code framework for building custom LLMs, neural networks
How to optimize some algorithm in cuda
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Open-source large language model family from Tencent Hunyuan
Visual intelligence for your home.
Tools for merging pretrained large language models