Search Results for "tests"
Sort By:
Code for the paper "Evaluating Large Language Models Trained on Code"
An agentless approach to automatically solve software development
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Benchmark LLMs by fighting in Street Fighter 3
A security scanner for custom LLM applications
LLM powered fuzzing via OSS-Fuzz