Showing 78 open source projects for "functional testing tool"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Gemini MCP Tool

    Gemini MCP Tool

    MCP server that enables AI assistants to interact with Google Gemini

    ...It supports workflows where users can reference files or directories using simple syntax, allowing the AI to process entire projects or documents in a single request. The tool also includes sandbox execution features, which allow safe testing of code or commands in an isolated environment without affecting the host system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    CyberStrikeAI

    CyberStrikeAI

    CyberStrikeAI is an AI-native security testing platform built in Go

    ...It supports role-based testing, letting teams define security roles with tailored tool access and prompts, and includes a skills system that encapsulates specialized testing strategies that the AI can incorporate into its planning. Through comprehensive lifecycle management, results are tracked, aggregated, and visualized, with support for versioned persistence, search, and risk severity scoring.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    MCPJam

    MCPJam

    Postman for MCPs - A tool for testing and debugging MCPs

    Inspector by MCPJam is a visual developer tool—akin to Postman—for testing and debugging MCP servers, with capabilities to simulate and trace tool execution via various transports and LLM integrations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    BruteForceAI

    BruteForceAI

    Advanced LLM-powered brute-force tool combining AI intelligence

    BruteForceAI is an open-source security testing tool that applies large language models to the analysis of login forms and authentication flows in web applications. At a high level, the project uses AI to inspect HTML content, identify the relevant form elements, and automate selector discovery so that a tester does not need to hand-map every field before evaluation. It combines that analysis layer with automated credential testing workflows, framing itself as a more adaptive alternative to older brute-force tooling that depends heavily on manual configuration. ...
    Downloads: 112 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Arcade AI

    Arcade AI

    Arcade Tool Development Kit (TDK), Worker, Evals, and CLI

    ...Core platform functionality and schemas. This repository contains the core Arcade libraries, organized as separate packages for maximum flexibility and modularity. Evaluation framework for testing tool performance. Test your MCP server's tools, resources, prompts, elicitation, and OAuth 2. MCPJam is compliant with the latest MCP specs. Connect to any MCP server. MCPJam inspector supports STDIO, SSE, and Streamable HTTP transports.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    serve-sim

    serve-sim

    The `npx serve` of Apple Simulators

    ...It can run locally, over a LAN, or through a remote Mac with tunneling. The web UI streams the simulator and forwards clicks, enabling browser-based end-to-end testing and debugging. serve-sim is best suited for iOS development, agent testing, remote simulator access, and mobile UI automation workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Cactus Needle

    Cactus Needle

    26m function call model that runs on incredibly small devices

    Needle is an experimental 26-million-parameter function-calling model designed to run on extremely small devices such as phones, watches, glasses, and low-power personal AI hardware. It is based on a Simple Attention Network architecture and was distilled from a much larger model to focus on fast, compact tool-use behavior. The project provides open weights, training details, dataset generation resources, and a playground for testing the model with custom tools. Needle is optimized for single-shot function calling rather than broad conversational ability, so its core use case is selecting the right tool and producing structured arguments. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    OpenClaw Installer

    OpenClaw Installer

    ClawdBot one-click deployment tool

    OpenClaw Installer is an open-source one-click deployment and configuration tool for installing OpenClaw — a personal AI assistant — onto systems with minimal manual setup, giving users a streamlined path to get their own AI assistant running quickly. The project provides shell scripts and configuration menus that detect the host environment, install dependencies, download OpenClaw, configure core settings like AI models and identity channels, and start the server automatically. It supports...
    Downloads: 38 This Week
    Last Update:
    See Project
  • 9
    Expect

    Expect

    Let agents test your code in a real browser

    ...The design suggests a focus on productivity, reducing cognitive load when writing and reviewing tests or validation scripts. It is likely adaptable across multiple contexts, including unit testing, integration testing, and runtime assertions. By abstracting repetitive validation logic, expect helps developers focus on behavior rather than implementation details. Overall, it serves as a lightweight but powerful tool for improving software reliability and clarity in testing workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    FuzzyAI Fuzzer

    FuzzyAI Fuzzer

    A powerful tool for automated LLM fuzzing

    ...The framework can be integrated into development pipelines to continuously test AI APIs and detect weaknesses before deployment. FuzzyAI provides testing tools, datasets, and evaluation workflows that help researchers measure how well models resist harmful instructions or attempts to bypass safety mechanisms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    CodeBurn

    CodeBurn

    See where your AI coding tokens go

    CodeBurn is a security-focused tool designed to evaluate and stress-test codebases using adversarial techniques, often leveraging AI to identify vulnerabilities and weaknesses. It simulates attack scenarios against code to uncover potential security risks, helping developers proactively identify issues before they reach production. The system is designed to integrate into development workflows, allowing continuous testing as code evolves.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    kotaemon

    kotaemon

    An open-source RAG-based tool for chatting with your documents

    An open-source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind. This project serves as a functional RAG UI for both end users who want to do QA on their documents and developers who want to build their own RAG pipeline.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    PySpur

    PySpur

    Visual tool for building, testing, and deploying AI agent workflows

    PySpur is a visual development environment designed to help AI engineers build, test, and iterate on agent-based workflows more efficiently. It provides a structured playground where users can define test cases, construct agents either through Python code or a graphical interface, and continuously refine their behavior. It addresses common challenges in AI agent development such as prompt tuning difficulties and lack of visibility into workflow execution. By offering a visual representation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    agents-cli

    agents-cli

    CLI to turn coding assistants into expert at deploying AI agents

    agents-cli is a command-line tool developed to simplify the creation, management, and execution of AI agents directly from the terminal. It provides developers with a structured interface for defining agent behavior, configuring tools, and running workflows. The tool integrates with agent frameworks and supports modular extensions for adding new capabilities. It emphasizes productivity by enabling rapid iteration and testing of agent logic without complex setup. agents-cli is designed to fit into modern developer workflows, particularly those that rely on automation and scripting. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    jcode

    jcode

    Coding Agent Harness

    jcode is a lightweight developer tool designed to streamline Java coding workflows by simplifying compilation, execution, and testing processes. It provides a structured interface for managing Java programs without requiring complex IDE setups, making it ideal for quick experimentation and learning. The tool focuses on reducing friction for developers who want to run code snippets or small projects efficiently.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 16
    Deepchecks

    Deepchecks

    Test Suites for validating ML models & data

    Deepchecks is the leading tool for testing and for validating your machine learning models and data, and it enables doing so with minimal effort. Deepchecks accompany you through various validation and testing needs such as verifying your data’s integrity, inspecting its distributions, validating data splits, evaluating your model and comparing between different models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Prompt Optimizer

    Prompt Optimizer

    A prompt word optimizer to help write high-quality prompt words

    Prompt-Optimizer is a high-impact AI prompt engineering tool designed to help users craft better, more effective prompts for large language models, boosting the quality and relevance of AI responses. It focuses on automating and streamlining the iterative refinement of prompts by analyzing examples, comparing original and optimized text, and guiding users through multi-round improvements that surface clarity, structure, and specificity.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    Easy DataSet

    Easy DataSet

    A powerful tool for creating datasets for LLM fine-tuning

    ...The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. Beyond dataset creation, Easy-dataset also provides a built-in evaluation system with model testing and blind-test features, helping teams validate model performance using curated test sets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    ToolUniverse

    ToolUniverse

    Democratizing AI scientists with ToolUniverse

    ToolUniverse is a comprehensive open-source ecosystem designed to transform any large language model into an autonomous “AI scientist” capable of performing real scientific research tasks through structured tool interaction. It standardizes how AI systems discover, select, and execute tools by introducing a unified AI-Tool Interaction Protocol that allows models to seamlessly connect with hundreds of scientific resources, including machine learning models, datasets, APIs, and analytical packages. Instead of requiring custom pipelines or fine-tuning, ToolUniverse wraps around existing models and enables them to reason, experiment, and iterate on complex workflows such as drug discovery, data analysis, and hypothesis testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Mini Agent

    Mini Agent

    A minimal yet professional single agent demo project

    ...Mini-Agent also comes with “Claude Skills”-style predefined skills for tasks like document processing, design work, and testing, packaged as reusable behaviors that can be invoked by the agent as needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Ollama Grid Search

    Ollama Grid Search

    A multi-platform desktop application to evaluate and compare LLM

    Ollama Grid Search is a desktop application designed to automate the evaluation and comparison of large language models, prompts, and inference parameters in a structured and repeatable way. Instead of manually testing combinations, the tool performs grid search experiments by iterating across different models, prompt variations, and parameter configurations, allowing users to quickly identify optimal setups for specific tasks. It provides a visual interface where experiment results can be inspected, compared, and refined, making it especially useful for prompt engineering and benchmarking workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Claw Code

    Claw Code

    AI agent harness for AI coding agents

    Claw Code is an open-source AI agent harness project focused on building better tools for orchestrating and managing autonomous coding agents. It originated as a clean-room reimplementation inspired by the architecture of Claude Code, aiming to replicate core concepts without using proprietary code. The project provides a Python-based foundation for experimenting with agent workflows, tool integration, and task execution pipelines. It emphasizes harness engineering—how agents are structured,...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 23
    AgentBench

    AgentBench

    A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

    ...These environments require agents to interpret instructions, take actions, and adapt their strategies based on feedback from the environment. AgentBench also includes an evaluation framework that measures success rates, rewards, and task completion performance across different agent implementations. By testing models across diverse scenarios, the benchmark highlights strengths and weaknesses in reasoning, long-term planning, and tool usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MiMo-V2-Flash

    MiMo-V2-Flash

    MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation

    MiMo-V2-Flash is a large Mixture-of-Experts language model designed to deliver strong reasoning, coding, and agentic-task performance while keeping inference fast and cost-efficient. It uses an MoE setup where a very large total parameter count is available, but only a smaller subset is activated per token, which helps balance capability with runtime efficiency. The project positions the model for workflows that require tool use, multi-step planning, and higher throughput, rather than only...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 25
    MCP Server and GW

    MCP Server and GW

    An MCP stdio to HTTP SSE transport gateway with example server

    mcp-server-and-gw is a Model Context Protocol (MCP) gateway that bridges standard input/output to HTTP Server-Sent Events (SSE) transport. It includes an example MCP server and client, facilitating the development and testing of MCP implementations. This tool is particularly useful for integrating MCP servers with applications like Claude Desktop that may not natively support remote server connections. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
Auth0 Logo