The toolkit to test, validate, and evaluate your models and surface
Python version of the Playwright testing and automation library
Collaborative & Open-Source Quality Assurance for all AI models
SWE-agent takes a GitHub issue and tries to automatically fix it
Optimize your code automatically with AI
Arcade Tool Development Kit (TDK), Worker, Evals, and CLI
YOLOv5 is the world's most loved vision AI
General proxy performance testing tool based on Clash using Telegram
Python-based continuous integration testing framework
Malicious traffic detection system
Tools like web browser, computer access and code runner for LLMs
Open-source industrial-grade ASR models
A Python toolbox for performing gradient-free optimization
A coin that can be mined with almost everything
High-Performance Symbolic Regression in Python and Julia
An implementation of a quantum simulator that you can run locally
Unified Multimodal Understanding and Generation Models
Open-source data observability for analytics engineers
Open-weight, large-scale hybrid-attention reasoning model
Pythonic tool for running machine-learning/high performance workflows
Full stack, modern web application generator
Your open-source LLM evaluation toolkit
Code for Cicero, an AI agent that plays the game of Diplomacy
Super Tiny Icons are miniscule SVG versions of your favourite website
Python package that generates fake data for you