GPU benchmark testing graphics performance with realistic 3D scenes.
Drill is an HTTP load testing application written in Rust
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
benchmark tooling that loves you
A benchmarking framework for the Julia language
Fully autonomous AI hacker to find actual exploits in your web apps
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Integrates the JMH benchmarking framework with Gradle
Leaderboard Comparing LLM Performance at Producing Hallucinations
The Abstraction and Reasoning Corpus
GPU stress test OpenGL and Vulkan graphics benchmark Windows/Linux
Precision CPU stress testing and benchmarking
Fast, flexible LLM inference
Free stress test tool for your PC
Advanced OpenGL and Vulkan graphics card stress testing utility
A unified, comprehensive and efficient recommendation library
State of the art LLM and coding model
Stress-Test your Processor
Benchmark CPU, GPU, memory, and storage
Java Disk Benchmark Utility
powerMAX is a CPU and GPU burn-in test
Chinese safety prompts for evaluating and improving the safety of LLMs
A tiny KV storage based on skiplist written in C++ language
Automatic n-dimensional data clustering tool powered by five advanced
A benchmark suite for reliable testing of IP networks.