Evaluation and Tracking for LLM Experiments
Python library and shell utilities to monitor filesystem events
A collection of notebooks/recipes showcasing ways of using Claude
The ultimate RAG for your monorepo
Fast State-of-the-Art Static Embeddings
Minimal Python framework for scalable AI inference servers fast
High-performance inference server for text embeddings models API layer
Mastering Applied AI, One Concept at a Time
Ready-to-run cloud templates for RAG
Document Index for Vectorless, Reasoning-based RAG
AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories
RAG-Anything: All-in-One RAG Framework
Making RAG Simpler with Small and Open-Sourced Language Models
SimpleMem: Efficient Lifelong Memory for LLM Agents
A New Axis of Sparsity for Large Language Models
Knowledge Graph Generation from Any Text
Kimi Code CLI is your next CLI agent
Build production-ready AI agents in both Python and Typescript
Low-latency AI inference engine optimized for mobile devices
Ship AI Agents to Google Cloud in minutes, not months
AI-powered document analysis and tagging for Paperless-ngx
Local RAG engine for private multimodal knowledge search on devices
A collection of scientific methods, processes, algorithms
Learning to Reason with Search for LLMs via Reinforcement Learning
Traditional Mandarin LLMs for Taiwan