Jlama is a modern LLM inference engine for Java
E2B Desktop Sandbox for LLMs. E2B Sandbox
Chat with any codebase in under two minutes | Fully local
local-first semantic code search engine
AWS-native chatbot using Bedrock
E2M converts various file types (doc, docx, epub, html, htm, url
Your Personal Research Multi-Tool
Unified KV Cache Compression Methods for Auto-Regressive Models
An open-source, code-first Java toolkit
Learning to Reason with Search for LLMs via Reinforcement Learning
TT-NN operator library, and TT-Metalium low level kernel programming
Take control of your AI agents
Traditional Mandarin LLMs for Taiwan
csghub-server is the backend server for CSGHub
Fast Multimodal LLM on Mobile Devices
Korvus is a search SDK that unifies the entire RAG pipeline
Benchmark LLMs by fighting in Street Fighter 3
Local CLI Copilot, powered by Ollama
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Open-source LLM load balancer and serving platform for hosting LLMs
Recipes to train reward model for RLHF
AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake
Here comes a selection of technology stacks and tool repositories
Constrained Value Alignment via Safe Reinforcement Learning
Official Repo for ICML 2024 paper