Test-Time Reinforcement Learning
Supercharge Your LLM Application Evaluations
AI tool that generates tests to improve code coverage quickly
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation
The easiest way to use deep metric learning in your application
YOLOv5 is the world's most loved vision AI
Collaborative & Open-Source Quality Assurance for all AI models
AI Agent Evaluator & Red Team Platform
Implementation of TurboQuant (ICLR 2026)
General proxy performance testing tool based on Clash using Telegram
A powerful tool for automated LLM fuzzing
PaddlePaddle End-to-End Development Toolkit
Visual tool for building, testing, and deploying AI agent workflows
SWE-agent takes a GitHub issue and tries to automatically fix it
Free, open source crypto trading bot
Evaluate and monitor ML models from validation to production
A python library that makes AMR parsing, generation and visualization
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
Test Suites for validating ML models & data
Arcade Tool Development Kit (TDK), Worker, Evals, and CLI
Tools like web browser, computer access and code runner for LLMs
ComfyUI wrapper nodes for WanVideo and related models
Fast and Universal 3D reconstruction model for versatile tasks
MTEB: Massive Text Embedding Benchmark
Practice implementing softmax, attention, GPT-2 and more