Simple, Pythonic building blocks to evaluate LLM applications
Open-source AI hackers to find and fix your app’s vulnerabilities
A powerful tool for automated LLM fuzzing
AI Agent Evaluator & Red Team Platform
A powerful tool for creating datasets for LLM fine-tuning
SDG is a specialized framework
270+ Claude Code plugins with 739 agent skills
Advanced LLM-powered brute-force tool combining AI intelligence
The platform for LLM evaluations and AI agent testing
Fast, flexible LLM inference
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Tools like web browser, computer access and code runner for LLMs
A security scanner for custom LLM applications
Low-code app builder for RAG and multi-agent AI applications
Personal AI Notebooks. Organize files & webpages and generate notes
All-in-one AI companion! Desktop girlfriend + virtual streamer
An open-source, code-first Java toolkit
The open source post-building layer for agents
One-stop solution for creating your digital avatar from chat history
AI-powered penetration testing assistant using local LLM on linux
Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI
Evaluate your LLM's response with Prometheus and GPT4
AWS-native chatbot using Bedrock
AI-powered bridge connecting LLMs and advanced AI agents
A high-performance ML model serving framework, offers dynamic batching