Open-source AI hackers to find and fix your app’s vulnerabilities
A powerful tool for automated LLM fuzzing
Advanced LLM-powered brute-force tool combining AI intelligence
AI Agent Evaluator & Red Team Platform
270+ Claude Code plugins with 739 agent skills
SDG is a specialized framework
Simple, Pythonic building blocks to evaluate LLM applications
A security scanner for custom LLM applications
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Tools like web browser, computer access and code runner for LLMs
A high-performance ML model serving framework, offers dynamic batching
E2B Desktop Sandbox for LLMs. E2B Sandbox
Evaluate your LLM's response with Prometheus and GPT4
One-stop solution for creating your digital avatar from chat history
The open source post-building layer for agents
AI-powered penetration testing assistant using local LLM on linux
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Semi-Structured Agentic Framework. Workflows build themselves
Leaderboard Comparing LLM Performance at Producing Hallucinations
Chinese Llama-3 LLMs) developed from Meta Llama 3
Retrieval Augmented Generation (RAG) framework
The unofficial python package that returns response of Google Bard
8.5K high quality grade school math problems
An implementation of model parallel GPT-2 and GPT-3-style models