Fast, flexible LLM inference
A powerful tool for creating datasets for LLM fine-tuning
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Tools like web browser, computer access and code runner for LLMs
The platform for LLM evaluations and AI agent testing
The open source post-building layer for agents
Open-source LLM load balancer and serving platform for hosting LLMs
A high-performance ML model serving framework, offers dynamic batching
State of the art LLM and coding model
Leaderboard Comparing LLM Performance at Producing Hallucinations
8.5K high quality grade school math problems