A Ruby Implementation of the Model Context Protocol
Port of Facebook's LLaMA model in C/C++
A RWKV management and startup tool, full automation, only 8MB
Fast and efficient unstructured data extraction
Drag & drop UI to build your customized LLM flow
TokenSpeed is a speed-of-light LLM inference engine
Fast, flexible LLM inference
A high-throughput and memory-efficient inference and serving engine
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
AI-powered markdown editor - leverage LLMs with your documents
Open-Source Analytics Infrastructure
From Paper to Presentation in One Click
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Zep: A long-term memory store for LLM / Chatbot applications
Fast, local-first web content extraction for LLMs
A lightweight vLLM implementation built from scratch
Integrating LLMs into structured NLP pipelines
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Vim plugin for LLM-assisted code/text completion
Analyzing Hacker News discussions from a decade ago in hindsight
A New Axis of Sparsity for Large Language Models
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Scalable data pre processing and curation toolkit for LLMs
Fast Multimodal LLM on Mobile Devices
Build multimodal language agents for fast prototype and production