A GPU overclock & undervolt tool for various Snapdragon chips
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Reference implementations of MLPerf™ training benchmarks
Strong, Economical, and Efficient Mixture-of-Experts Language Model
A high-performance HTTP benchmarking tool
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
A simple generic set type for the Go language
Fast, disk space efficient package manager
Cluster computing framework for processing large-scale geospatial data
The Abstraction and Reasoning Corpus
General plug-and-play inference library for Recursive Language Models
Collection of reference environments, offline reinforcement learning
bsuite is a collection of carefully-designed experiments
Python-based research interface for blackbox
benchmark tooling that loves you
Fully autonomous AI hacker to find actual exploits in your web apps
Benchmark LLMs by fighting in Street Fighter 3
Import public NYC taxi and for-hire vehicle (Uber, Lyft)
The first large-scale public benchmark dataset for image harmonization
A reinforcement learning package for Julia
Fast JSON encoder/decoder compatible with encoding/json for Go
Provider-agnostic, open-source evaluation infrastructure
Collections of robotics environments
Performance monitoring and benchmarking suite
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models