A Ruby Implementation of the Model Context Protocol
Port of Facebook's LLaMA model in C/C++
A high-throughput and memory-efficient inference and serving engine
Drag & drop UI to build your customized LLM flow
Fast and efficient unstructured data extraction
A RWKV management and startup tool, full automation, only 8MB
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Fast, flexible LLM inference
AI-powered markdown editor - leverage LLMs with your documents
Open-Source Analytics Infrastructure
Zep: A long-term memory store for LLM / Chatbot applications
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
A lightweight vLLM implementation built from scratch
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Integrating LLMs into structured NLP pipelines
Fast Multimodal LLM on Mobile Devices
From Paper to Presentation in One Click
Vim plugin for LLM-assisted code/text completion
Analyzing Hacker News discussions from a decade ago in hindsight
A New Axis of Sparsity for Large Language Models
Scalable data pre processing and curation toolkit for LLMs
Fast, local-first web content extraction for LLMs
Build multimodal language agents for fast prototype and production
LightLLM is a Python-based LLM (Large Language Model) inference
Low-latency REST API for serving text-embeddings