Python version of the Playwright testing and automation library
Python-based continuous integration testing framework
Malicious traffic detection system
Install and run Python applications in isolated environments
Collaborative & Open-Source Quality Assurance for all AI models
Arcade Tool Development Kit (TDK), Worker, Evals, and CLI
Terminal-based CPU stress and monitoring utility
High-performance reconnaissance and vulnerability scanning tool
Evaluation suite designed to assess the performance of LLMs
AI agent harness for AI coding agents
Utilize all available CPU cores for accepting new client connections
Multi-Joint dynamics with Contact. A general purpose physics simulator
High-performance fake data generator for Python
General proxy performance testing tool based on Clash using Telegram
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
TextWorld is a sandbox learning environment for the training
Lightweight framework for evaluating large language model performance
Autonomous harness engineering
How to improve NGINX performance, security, and other important things
Tools like web browser, computer access and code runner for LLMs
Build high-quality LLM apps
The open source post-building layer for agents
A framework that facilitates all stages of LLM development
Google Toolbox for Mac
Public CI, Docker images for popular JAX libraries