Code for the paper "Evaluating Large Language Models Trained on Code"
CLI proxy that reduces LLM token consumption
An agentless approach to automatically solve software development
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Benchmark LLMs by fighting in Street Fighter 3
A security scanner for custom LLM applications
Test and evaluate LLMs and model configurations
LLM powered fuzzing via OSS-Fuzz