Showing 45 open source projects for "test cases"

View related business solutions
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    Hypothesis

    Hypothesis

    The property-based testing library for Python

    Hypothesis is a powerful library for property-based testing in Python. Instead of writing specific test cases, users define properties and Hypothesis generates random inputs to uncover edge cases and bugs. It integrates with unittest and pytest, shrinking failing examples to minimal reproducible cases. Widely adopted in production systems, Hypothesis boosts code reliability by exploring input spaces far beyond manually crafted tests.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    django-webpack-loader

    django-webpack-loader

    Transparently use webpack with django

    Use webpack to generate your static bundles without django's static files or opaque wrappers. Django webpack loader consumes the output generated by webpack-bundle-tracker and lets you use the generated bundles in Django. Test cases cover Django>=2.0 on Python>=3.5. 100% code coverage is the target so we can be sure everything works anytime. It should probably work on older versions of Django as well but the package does not ship any test cases for them. Before configuring django-webpack-loader, let's first configure what's necessary on the webpack-bundle-tracker side. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Schemathesis

    Schemathesis

    Guarantee flawless API functionality with test scenarios

    Guarantee flawless API functionality with thorough, high-quality test scenarios generated from your API specification. Schemathesis is a specification-centric API testing tool for Open API and GraphQL-based applications. It reads the application schema and generates test cases, which will ensure that your application is compliant with its schema and never crashes. The application under test could be written in any language; the only thing you need is a valid API schema in a supported format. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    ScanAPI

    ScanAPI

    Automated Integration Testing and Live Documentation for your API

    Given an API specification, written in YAML/JSON format, ScanAPI hits the specified endpoints, runs the test cases, and generates a detailed report of this execution - which can also be used as the API documentation itself. With almost no Python knowledge, the user can define endpoints to be hit, the expected behavior for each response and will receive a full real-time diagnostic report of the API.
    Downloads: 9 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    frida

    frida

    Dynamic instrumentation toolkit for developers

    ...We want to empower the next generation of developer tools, and help other free software developers achieve interoperability through reverse engineering. We are proud that NowSecure is using Frida to do fast, deep analysis of mobile apps at scale. Frida has a comprehensive test-suite and has gone through years of rigorous testing across a broad range of use-cases.
    Downloads: 478 This Week
    Last Update:
    See Project
  • 6
    Mobly

    Mobly

    E2E test framework for tests with complex environment requirements

    Mobly is a Python-based test framework that specializes in supporting test cases that require multiple devices, complex environments, or custom hardware setups. P2P data transfer between two devices. Conference calls across three phones. Wearable device interacting with a phone. Internet-Of-Things devices interacting with each other. Testing RF characteristics of devices with special equipment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Union Pandera

    Union Pandera

    Light-weight, flexible, expressive statistical data testing library

    ...A simple, zero-configuration data testing framework for data scientists and ML engineers seeking correctness. Access a comprehensive suite of built-in tests, or easily create your own validation rules for your specific use cases. Validate the functions that produce your data by automatically generating test cases for them. Integrate seamlessly with the Python ecosystem. Overcome the initial hurdle of defining a schema by inferring one from clean data, then refine it over time. Identify the critical points in your data pipeline, and validate data going in and out of them. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    PySpur

    PySpur

    Visual tool for building, testing, and deploying AI agent workflows

    PySpur is a visual development environment designed to help AI engineers build, test, and iterate on agent-based workflows more efficiently. It provides a structured playground where users can define test cases, construct agents either through Python code or a graphical interface, and continuously refine their behavior. It addresses common challenges in AI agent development such as prompt tuning difficulties and lack of visibility into workflow execution.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    invisible_playwright

    invisible_playwright

    Anti-Detect Browser that passes every bot detection test

    invisible_playwright is a stealth browser automation project that presents itself as a drop-in Playwright replacement built around a patched Firefox browser. It is designed to preserve Playwright-style automation while reducing the browser fingerprints that automated sessions often expose. The project targets advanced browser testing, AI browser workflows, and automation scenarios where normal headless browsers may be blocked or classified as bots. It includes claims about passing common...
    Downloads: 38 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    Rogue

    Rogue

    AI Agent Evaluator & Red Team Platform

    Rogue is an open-source evaluation and red-team framework designed to test the reliability, safety, and policy compliance of AI agents. The platform automatically interacts with an AI agent by generating dynamic scenarios and multi-turn conversations that simulate real-world interactions. Instead of relying solely on static test scripts, Rogue uses an agent-as-a-judge architecture where one agent probes another agent to detect failures or unexpected behaviors. The system allows developers to...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    ASSERT

    ASSERT

    Requirement-driven evaluation harness for AI agents and LLM

    ...It turns natural-language specifications, policies, product requirements, and launch criteria into structured tests that can be reviewed, executed, scored, and improved. The pipeline derives behavior categories, generates single-turn and multi-turn test cases, runs them against a target system, and uses an LLM judge to score conversations against the stated policies. It can evaluate hosted models, custom agents, multi-agent systems, REST clients, and frameworks such as LangGraph, CrewAI, AutoGen, DSPy, LlamaIndex, and OpenAI Agents SDK. ASSERT is designed to close the gap between what a system is supposed to do and what evaluation actually measures. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    AICGSecEval

    AICGSecEval

    A.S.E (AICGSecEval) is a repository-level AI-generated code security

    ...The project was developed to address concerns that AI-assisted programming tools may produce insecure code containing vulnerabilities such as injection flaws or unsafe logic. The framework constructs evaluation tasks based on real-world software repositories and known vulnerability cases derived from CVE records. By simulating realistic development scenarios, the benchmark assesses how well AI code generation systems handle security-sensitive programming tasks. AICGSecEval combines static and dynamic evaluation techniques to analyze generated code for vulnerabilities and functional correctness. The framework includes datasets, test cases, and evaluation metrics that measure how AI programming tools perform across multiple programming languages and vulnerability categories.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    GoPay Workflow Orchestrator

    GoPay Workflow Orchestrator

    Lightweight process orchestration framework for regional payment

    ...It includes an orchestrator-style structure and supporting utilities, including an OTP-forwarding component for controlled testing environments. Because it interacts with payment and verification flows, it should only be used with accounts, systems, and test cases the user is authorized to operate. Its main purpose is payment integration analysis, automated testing, and workflow observability.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Atheris

    Atheris

    A Coverage-Guided, Native Python Fuzzer

    Atheris is a coverage-guided fuzzer for CPython that treats Python as a first-class fuzzing target, enabling rapid discovery of crashes and logic errors in pure-Python code and native extensions. It hooks into Python’s interpreter to collect fine-grained coverage and uses that signal to evolve inputs, pushing programs into previously unexplored code paths. Because many Python libraries are thin wrappers over C/C++ code, Atheris is equally adept at surfacing memory safety issues in extension...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    aisuite

    aisuite

    Simple, unified interface to multiple Generative AI providers

    ...Using an interface similar to OpenAI's, aisuite makes it easy to interact with the most popular LLMs and compare the results. It is a thin wrapper around Python client libraries and allows creators to seamlessly swap out and test responses from different LLM providers without changing their code. Today, the library is primarily focused on chat completions. We will expand it to cover more use cases in the near future. Currently supported providers are - OpenAI, Anthropic, Azure, Google, AWS, Groq, Mistral, HuggingFace and Ollama. To maximize stability, aisuite uses either the HTTP endpoint or the SDK for making calls to the provider.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    AWS X-Ray SDK for Python

    AWS X-Ray SDK for Python

    AWS X-Ray SDK for the Python programming language

    ...If customer wants to enable generating secure random trace and entity id for all the (sampled/unsampled) requests (this is applicable for trace id injection into logs use case) then they should set the AWS_XRAY_NOOP_ID environment variable as False. Oftentimes, it may be useful to be able to disable X-Ray for specific use cases, whether to stop X-Ray from sending traces at any moment or to test code functionality that originally depended on X-Ray instrumented packages to begin segments prior to the code call. For example, if your application relied on an XRayMiddleware to instrument incoming web requests, and you have a method that begins subsegments based on the segment generated.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Manim Skills Repository

    Manim Skills Repository

    Agent skills for Manim to create 3Blue1Brown style animations

    ...The project aims to wrap common animation logic into structured skills that can be invoked programmatically, reducing boilerplate and allowing creators to focus on creative content rather than low-level animation setup. With clear installation instructions and a collection of test cases, it’s approachable for developers comfortable with Python and interested in educational or technical visualization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Aden Hive

    Aden Hive

    Outcome driven agent development framework that evolves

    Hive is an open-source agent development framework that helps developers build autonomous, reliable, self-improving AI agents by letting them describe goals in ordinary natural language instead of hand-coding detailed workflows. Rather than manually defining execution graphs, Hive’s coding agent generates the agent graph, connection code, and test cases based on your high-level objectives, enabling outcome-driven agent creation that fits real business processes. Once deployed, agents can capture failure data, evolve automatically to meet their success criteria, and redeploy without constant manual intervention, delivering continual improvement over time. The framework also includes human-in-the-loop nodes, credential management, cost and budget controls, and real-time observability so teams can monitor execution and intervene as needed. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    Yao Open Prompts

    Yao Open Prompts

    A Chinese AI prompt vocabulary covering work, learning, content, etc.

    ...It is designed as a practical catalog that users can browse, copy, adapt, and test in their preferred AI model. The project also includes templates, references, maintenance checklists, scripts, and a complete catalog for easier navigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    FlowLens MCP

    FlowLens MCP

    Open-source MCP server that gives your coding agent

    FlowLens MCP Server is an open-source tool designed to give AI-powered coding agents (like Claude Code, Cursor, GitHub Copilot / Codex, and others) full, replayable browser context to dramatically improve debugging, bug reporting, and regression testing for web applications. It works together with a companion browser extension: when a user reproduces a bug or a complicated UI interaction, the extension captures a rich session log, including screen/video recording, network traffic, console...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Encord Active

    Encord Active

    The toolkit to test, validate, and evaluate your models and surface

    ...Discover errors, outliers, and edge-cases within your data - all in one open source toolkit. Get a high level overview of your data distribution, explore it by customizable quality metrics, and discover any anomalies. Use powerful similarity search to find more examples of edge-cases or outliers.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22
    Aviary

    Aviary

    Ray Aviary - evaluate multiple LLMs easily

    Aviary is an LLM serving solution that makes it easy to deploy and manage a variety of open source LLMs. Providing an extensive suite of pre-configured open source LLMs, with defaults that work out of the box. Supporting Transformer models hosted on Hugging Face Hub or present on local disk. Aviary has native support for autoscaling and multi-node deployments thanks to Ray and Ray Serve. Aviary can scale to zero and create new model replicas (each composed of multiple GPU workers) in...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    OWASP Mobile Application Security

    OWASP Mobile Application Security

    Manual for mobile app security testing and reverse engineering

    The OWASP Mobile Application Security (MAS) flagship project provides a security standard for mobile apps (OWASP MASVS) and a comprehensive testing guide (OWASP MASTG) that covers the processes, techniques, and tools used during a mobile app security test, as well as an exhaustive set of test cases that enables testers to deliver consistent and complete results. MAS Advocates are industry adopters of the OWASP MASVS and MASTG who have invested a significant and consistent amount of resources to push the project forward by providing consistent high-impact contributions and continuously spreading the word.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 24
    CodeContests

    CodeContests

    Large dataset of coding contests designed for AI and ML model training

    ...CodeContests aggregates problems and human-written solutions from multiple programming competition platforms, including AtCoder, Codeforces, CodeChef, Aizu, and HackerEarth. Each problem includes structured metadata, problem descriptions, paired input/output test cases, and multiple correct and incorrect solutions in various programming languages. The dataset is distributed in Riegeli format using Protocol Buffers, with separate training, validation, and test splits for reproducible machine learning experiments.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25

    Classic HWUT

    Software Unit Tests (Language Independent Approach)

    ...Tests are run by a simple call to hwut in a base directory of a project. In particular for C, HWUT supports make file generation using 'sos' and 'sols' modes. Remote control-able function stubs may be generated using the 'stub' mode. Test cases can be generated using the 'gen' mode, and state machine walkers by the 'sm_walker' mode. Over the last decade HWUT has matured towards a full fledged unit test tool for a wide variety of applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo