Collaborative & Open-Source Quality Assurance for all AI models
Fast, flexible LLM inference
Arcade Tool Development Kit (TDK), Worker, Evals, and CLI
NestJS Helper + AI Chatbot Development
A multi-platform desktop application to evaluate and compare LLM
Evaluation suite designed to assess the performance of LLMs
AI agent harness for AI coding agents
General proxy performance testing tool based on Clash using Telegram
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
A powerful tool for creating datasets for LLM fine-tuning
Behavior tree AI for Godot Engine
Lightweight framework for evaluating large language model performance
TextWorld is a sandbox learning environment for the training
Autonomous harness engineering
Tools like web browser, computer access and code runner for LLMs
Build high-quality LLM apps
The open source post-building layer for agents
Serving system for machine learning models
The platform for LLM evaluations and AI agent testing
A framework that facilitates all stages of LLM development
Open-source LLM load balancer and serving platform for hosting LLMs
MLX: An array framework for Apple silicon
Open source AI trading OS for autonomous multi-model trading systems
A high-performance ML model serving framework, offers dynamic batching
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning