Open-source AI hackers to find and fix your app’s vulnerabilities
A powerful tool for automated LLM fuzzing
AI Agent Evaluator & Red Team Platform
SDG is a specialized framework
270+ Claude Code plugins with 739 agent skills
Advanced LLM-powered brute-force tool combining AI intelligence
The platform for LLM evaluations and AI agent testing
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
A security scanner for custom LLM applications
All-in-one AI companion! Desktop girlfriend + virtual streamer
An open-source, code-first Java toolkit
The open source post-building layer for agents
Evaluate your LLM's response with Prometheus and GPT4
AWS-native chatbot using Bedrock
Leaderboard Comparing LLM Performance at Producing Hallucinations
Semi-Structured Agentic Framework. Workflows build themselves
Open-source LLM load balancer and serving platform for hosting LLMs
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Chinese Llama-3 LLMs) developed from Meta Llama 3
Retrieval Augmented Generation (RAG) framework
Chinese safety prompts for evaluating and improving the safety of LLMs