Code for the paper "Evaluating Large Language Models Trained on Code"
CLI proxy that reduces LLM token consumption
An agentless approach to automatically solve software development
Benchmark LLMs by fighting in Street Fighter 3
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A security scanner for custom LLM applications
Test and evaluate LLMs and model configurations
LLM powered fuzzing via OSS-Fuzz