Showing 266 open source projects for "source testing unit testing"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Expect

    Expect

    Let agents test your code in a real browser

    ...The design suggests a focus on productivity, reducing cognitive load when writing and reviewing tests or validation scripts. It is likely adaptable across multiple contexts, including unit testing, integration testing, and runtime assertions. By abstracting repetitive validation logic, expect helps developers focus on behavior rather than implementation details. Overall, it serves as a lightweight but powerful tool for improving software reliability and clarity in testing workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    PentestGPT

    PentestGPT

    Automated Penetration Testing Agentic Framework Powered by LLMs

    PentestGPT is an AI-powered autonomous penetration testing agent designed to perform intelligent, end-to-end security assessments using large language models. Published at USENIX Security 2024, it combines advanced reasoning with an agentic workflow to automate tasks traditionally handled by human pentesters. The platform supports multiple penetration testing categories, including web security, cryptography, reversing, forensics, privilege escalation, and binary exploitation. PentestGPT runs...
    Downloads: 275 This Week
    Last Update:
    See Project
  • 3
    LangCheck

    LangCheck

    Simple, Pythonic building blocks to evaluate LLM applications

    Simple, Pythonic building blocks to evaluate LLM applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Giskard

    Giskard

    Collaborative & Open-Source Quality Assurance for all AI models

    The testing framework dedicated to ML models, from tabular to LLMs. Giskard is an open-source testing framework dedicated to ML models, from tabular models to LLMs. Testing Machine Learning applications can be tedious. Since ML models depend on data, testing scenarios depend on the domain specificities and are often infinite. At Giskard, we believe that Machine Learning needs its own testing framework.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    OctoMind MCP

    OctoMind MCP

    An MCP server for octomind tools, resources and prompts

    The Octomind MCP Server is designed to integrate Octomind's end-to-end testing tools and resources into local development environments. It enables AI-powered interfaces to create, execute, and manage e2e tests, enhancing the testing workflow. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    PentAGI

    PentAGI

    Perform penetration testing tasks

    PentAGI is a fully autonomous AI agent system designed to perform complex penetration testing tasks by orchestrating multiple intelligent components into a coordinated offensive security workflow. The platform aims to automate significant portions of the penetration testing lifecycle, including reconnaissance, vulnerability discovery, and exploitation planning, reducing the amount of manual effort required from security professionals. It leverages agent-based architecture and AI reasoning to...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 7
    DeepEval
    DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating and testing large-language model systems. It is similar to Pytest but specialized for unit testing LLM outputs. DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Apache Hamilton

    Apache Hamilton

    Helps data scientists define testable self-documenting dataflows

    Apache Hamilton is an open-source Python framework designed to simplify the creation and management of dataflows used in analytics, machine learning pipelines, and data engineering workflows. The framework enables developers to define data transformations as simple Python functions, where each function represents a node in a dataflow graph and its parameters define dependencies on other nodes. Hamilton automatically analyzes these functions and constructs a directed acyclic graph...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    MCPJam

    MCPJam

    Postman for MCPs - A tool for testing and debugging MCPs

    Inspector by MCPJam is a visual developer tool—akin to Postman—for testing and debugging MCP servers, with capabilities to simulate and trace tool execution via various transports and LLM integrations.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    CodeBurn

    CodeBurn

    See where your AI coding tokens go

    CodeBurn is a security-focused tool designed to evaluate and stress-test codebases using adversarial techniques, often leveraging AI to identify vulnerabilities and weaknesses. It simulates attack scenarios against code to uncover potential security risks, helping developers proactively identify issues before they reach production. The system is designed to integrate into development workflows, allowing continuous testing as code evolves. It emphasizes automation, enabling large-scale...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    Strix

    Strix

    Open-source AI hackers to find and fix your app’s vulnerabilities

    Strix is an open source agent-driven security platform that uses autonomous AI agents to identify, investigate, and validate vulnerabilities in software applications. The system is designed to mimic the behavior of real attackers by executing dynamic testing and verifying findings through proof-of-concept exploitation. Unlike traditional vulnerability scanners that rely heavily on static analysis, Strix agents actively run code, probe systems, and attempt exploitation to confirm whether vulnerabilities are genuinely exploitable. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    Ollama Grid Search

    Ollama Grid Search

    A multi-platform desktop application to evaluate and compare LLM

    Ollama Grid Search is a desktop application designed to automate the evaluation and comparison of large language models, prompts, and inference parameters in a structured and repeatable way. Instead of manually testing combinations, the tool performs grid search experiments by iterating across different models, prompt variations, and parameter configurations, allowing users to quickly identify optimal setups for specific tasks. It provides a visual interface where experiment results can be...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 13
    PentestAgent

    PentestAgent

    AI agent framework for black-box security testing

    PentestAgent is an open-source autonomous security testing platform designed to help organizations identify vulnerabilities and assess security posture by simulating real-world attack scenarios without manual intervention. It brings a modular and automated approach to penetration testing by orchestrating a suite of tools and scripts that can emulate common exploitation techniques, reconnaissance workflows, and post-exploitation activities across targets.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    FuzzyAI Fuzzer

    FuzzyAI Fuzzer

    A powerful tool for automated LLM fuzzing

    ...The framework can be integrated into development pipelines to continuously test AI APIs and detect weaknesses before deployment. FuzzyAI provides testing tools, datasets, and evaluation workflows that help researchers measure how well models resist harmful instructions or attempts to bypass safety mechanisms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Auto Claude

    Auto Claude

    Autonomous multi-session AI coding

    Auto-Claude is an autonomous, multi-agent coding framework that organizes software work into a structured workflow where agents plan, build, and validate code with minimal manual micromanagement. Instead of relying on a single chat thread to do everything, it uses coordinated agents and a task-driven approach so multiple steps—like investigation, implementation, and testing—can be executed systematically. The project aims to make “agentic software engineering” feel like running a small...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 16
    Genkit

    Genkit

    An open source framework for building AI-powered apps

    Genkit is an open-source framework developed by Firebase for building AI-powered applications using familiar code-centric patterns. It simplifies the development, integration, and testing of AI features, providing observability and evaluation tools, and supports various models and platforms for versatile AI application development. ​
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Kheish

    Kheish

    Kheish: A multi-role LLM agent for tasks like code auditing

    Kheish is a framework designed for cybersecurity professionals to automate penetration testing tasks, providing tools to streamline security assessments.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    GoogleTest

    GoogleTest

    Google Testing and Mocking Framework

    GoogleTest is Google's C++ mocking and test framework. It's used by many internal projects at Google, as well as a number of notable projects such as The Chromium projects, the OpenCV computer vision library, and the LLVM compiler. This GoogleTest project is actually a union of what used to be two separate projects: the old GoogleTest and GoogleMock, an extension of GoogleTest for writing and using C++ mock classes. Since they were so closely related, they were merged to create an even...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 19
    Rogue

    Rogue

    AI Agent Evaluator & Red Team Platform

    ...The system allows developers to define specific scenarios, expected outcomes, and business rules so that the framework can verify whether an agent behaves according to required policies. During testing, Rogue records conversations and produces detailed reports that explain whether the agent passed or failed each scenario. These reports include reasoning and evidence, helping developers understand why a particular failure occurred.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    kMCP

    kMCP

    Kubernetes Controller for building, testing and deploying MCP servers

    KMCP is a companion toolchain for building, testing, and deploying MCP servers with a workflow that spans local development through Kubernetes production deployments. It includes a CLI for day-to-day development tasks like scaffolding new MCP projects, managing tools, building container images, and running an MCP server locally for validation. For cluster operations, it includes a Kubernetes controller that manages MCP server lifecycles using a dedicated Custom Resource Definition (CRD),...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    Claude Code Haha

    Claude Code Haha

    Claude Code leaked source - locally runnable version

    Claude Code Haha is an experimental and often humorous adaptation of Claude-style coding agents, designed to explore and demonstrate how agentic coding systems behave under different configurations and prompts. While it retains the core functionality of analyzing and modifying codebases, the project introduces variations that highlight both the strengths and quirks of autonomous coding assistants. It serves as a sandbox for testing how agents interpret instructions, manage context, and...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 22
    adversarial-spec

    adversarial-spec

    A Claude Code plugin that iteratively refines product specifications

    adversarial-spec is a framework focused on designing and testing systems using adversarial thinking to uncover weaknesses and improve robustness. It encourages developers to define specifications that anticipate failure modes, edge cases, and malicious inputs before implementing solutions. The project emphasizes proactive design, ensuring that systems are built with resilience in mind from the beginning. It provides structured approaches for identifying vulnerabilities and stress-testing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Prompt Optimizer

    Prompt Optimizer

    A prompt word optimizer to help write high-quality prompt words

    Prompt-Optimizer is a high-impact AI prompt engineering tool designed to help users craft better, more effective prompts for large language models, boosting the quality and relevance of AI responses. It focuses on automating and streamlining the iterative refinement of prompts by analyzing examples, comparing original and optimized text, and guiding users through multi-round improvements that surface clarity, structure, and specificity. With support for different deployment modes including...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 24
    MCP Server and GW

    MCP Server and GW

    An MCP stdio to HTTP SSE transport gateway with example server

    mcp-server-and-gw is a Model Context Protocol (MCP) gateway that bridges standard input/output to HTTP Server-Sent Events (SSE) transport. It includes an example MCP server and client, facilitating the development and testing of MCP implementations. This tool is particularly useful for integrating MCP servers with applications like Claude Desktop that may not natively support remote server connections. ​
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    LitterBox

    LitterBox

    A secure sandbox environment for malware developers and red teamers

    LitterBox is a controlled malware-analysis and payload-testing sandbox aimed at red teams who need to validate evasions and behaviors before deployment. It provides an isolated environment to exercise payloads against modern detection stacks, verify signatures and heuristics, and observe runtime characteristics without leaking binaries to third-party vendors. The README frames typical use cases: testing evasion, validating detections, analyzing behavior, and keeping sensitive tooling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB