26m function call model that runs on incredibly small devices
A powerful tool for automated LLM fuzzing
Collaborative & Open-Source Quality Assurance for all AI models
Test Suites for validating ML models & data
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Advanced LLM-powered brute-force tool combining AI intelligence
Based on AI Agent + MCP toolchain + penetration Skill orchestration
Arcade Tool Development Kit (TDK), Worker, Evals, and CLI
Fully automatic censorship removal for language models
Tool for exploring and debugging transformer model behaviors
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
SDG is a specialized framework
The most powerful Android RPA agent framework
The most powerful and modular diffusion model GUI, api and backend
Advanced language and coding AI model
Agentic, Reasoning, and Coding (ARC) foundation models
Open platform connecting AI agents to tools via unified MCP server
PaddlePaddle End-to-End Development Toolkit
Democratizing AI scientists with ToolUniverse
GUI Exploration Lab. One of the best GUI agent solutions
Simplifies the local serving of AI models from any source
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
GUI/CLI tool for downloading Xiaohongshu
Leaderboard Comparing LLM Performance at Producing Hallucinations
A minimal yet professional single agent demo project