Make your agents learn from experience
Power CLI and Workflow manager for LLMs (core package)
A Simple and Universal Swarm Intelligence Engine
TokenSpeed is a speed-of-light LLM inference engine
Universal LLM Deployment Engine with ML Compilation
SQL-Driven RAG Engine
A tension reasoning engine over 131 S-class problems
A Next-Generation Training Engine Built for Ultra-Large MoE Models
AI-Powered Data Processing: Use LOTUS to process all of your datasets
A high-throughput and memory-efficient inference and serving engine
950 line, minimal, extensible LLM inference engine built from scratch
A lightweight vLLM implementation built from scratch
local-first semantic code search engine
High-performance inference framework for large language models
A modular graph-based Retrieval-Augmented Generation (RAG) system
Tensor search for humans
Quick illustration of how one can easily read books together with LLMs
Claude + Obsidian knowledge companion
Parallax is a distributed model serving framework
Build multimodal language agents for fast prototype and production
LightLLM is a Python-based LLM (Large Language Model) inference
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Request recommended movies, TV shows and anime to Jellyseer/Overseer
GitLab automatic code review tool based on large models
Inference Llama 2 in one file of pure C