Showing 87 open source projects for "tests"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 1
    GoogleTest

    GoogleTest

    Google Testing and Mocking Framework

    ...Since they were so closely related, they were merged to create an even better GoogleTest. GoogleTest features an xUnit test framework, a rich set of assertions, user-defined assertions, death tests, among many others. It's been used on a variety of platforms, including Cygwin, Symbian, MinGW and PlatformIO.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    OctoMind MCP

    OctoMind MCP

    An MCP server for octomind tools, resources and prompts

    The Octomind MCP Server is designed to integrate Octomind's end-to-end testing tools and resources into local development environments. It enables AI-powered interfaces to create, execute, and manage e2e tests, enhancing the testing workflow. ​
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Qodo Cover

    Qodo Cover

    AI tool that generates tests to improve code coverage quickly

    Qodo Cover is an open source developer tool designed to automate the creation of unit tests using generative AI, helping teams improve code coverage with minimal manual effort. It operates as a command-line interface and can also be integrated into continuous integration workflows, making it adaptable to different development environments. It analyzes an existing codebase, identifies gaps in test coverage, and generates new tests that target uncovered or weakly tested areas. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Micro Agent

    Micro Agent

    AI CLI agent that writes code by iterating until tests pass

    Micro Agent is a command-line tool designed to generate and refine code using a test-driven approach powered by large language models. Instead of producing one-shot code outputs, it creates or uses test cases and repeatedly iterates on the generated code until those tests pass successfully. This workflow emphasizes reliability by using structured feedback from failing tests to guide improvements, reducing the need for manual debugging and iteration. Micro Agent intentionally limits its scope to a focused task, avoiding complex multi-file operations or full project automation in order to minimize compounding errors. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Telegram.Bot

    Telegram.Bot

    .NET Client for Telegram Bot API

    ...The guides here can even be useful to bot developers using other languages/platforms as it shows best practices in developing Telegram chatbots with examples. This project is fully tested using Unit tests and Systems Integration tests before each release. In fact, our test cases are self-documenting and serve as examples for Bot API methods. Once you learn the basics of Telegram chatbots, you will be able to easily understand the code in examples and use it in your own bot program.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    CodiumAI Cover-Agent

    CodiumAI Cover-Agent

    CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation

    CodiumAI Cover Agent aims to help efficiently increasing code coverage, by automatically generating qualified tests to enhance existing test suites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    HumanEval

    HumanEval

    Code for the paper "Evaluating Large Language Models Trained on Code"

    human-eval is a benchmark dataset and evaluation framework created by OpenAI for measuring the ability of language models to generate correct code. It consists of hand-written programming problems with unit tests, designed to assess functional correctness rather than superficial metrics like text similarity. Each task includes a natural language prompt and a function signature, requiring the model to generate an implementation that passes all provided tests. The benchmark has become a standard for evaluating code generation models, including those in the Codex and GPT families. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    AutoMaker

    AutoMaker

    Start directing AI agents

    Automaker is an autonomous AI development studio designed to transform how software is built by allowing developers to describe features, then watching AI agents implement code, tests, commits, and more with minimal manual typing. Instead of writing every line of code by hand, users add feature cards to a Kanban board with natural language descriptions, and AI agents powered by the Claude Agent SDK handle multi-step tasks such as planning, generating code, running tests, and committing to an isolated git worktree. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    HunyuanVideo

    HunyuanVideo

    HunyuanVideo: A Systematic Framework For Large Video Generation Model

    ...The framework aims to push the boundaries of video generation quality, incorporating multiple innovative approaches to improve the realism and coherence of the generated content. Release of FP8 model weights to reduce GPU memory usage / improve efficiency. Parallel inference code to speed up sampling, utilities and tests included.
    Downloads: 9 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 10
    AI Runner

    AI Runner

    Offline inference engine for art, real-time voice conversations

    ...The project has a strong focus on developer ergonomics, with thorough development guidelines, environment configuration using .env variables, and a clear structure for tests, tools and agents.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    Evidently

    Evidently

    Evaluate and monitor ML models from validation to production

    Evidently is an open-source Python library for data scientists and ML engineers. It helps evaluate, test, and monitor ML models from validation to production. It works with tabular, text data and embeddings.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Fara-7B

    Fara-7B

    An Efficient Agentic Model for Computer Use

    ...It provides stakeholders with a way to benchmark and evaluate models across dimensions such as fairness, robustness, security, privacy, and ethical considerations. Rather than relying on ad-hoc or manual review processes, FARA enables organizations to profile AI behavior using standardized tests, metrics, and reporting templates, making evaluations reproducible and comparable over time. The framework supports plugin-based modules that can be tailored to industry-specific concerns or regulatory requirements, helping compliance teams, auditors, and engineers collaborate on shared assessment goals.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MCPJam

    MCPJam

    Postman for MCPs - A tool for testing and debugging MCPs

    Inspector by MCPJam is a visual developer tool—akin to Postman—for testing and debugging MCP servers, with capabilities to simulate and trace tool execution via various transports and LLM integrations.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14
    rtk

    rtk

    CLI proxy that reduces LLM token consumption

    rtk is an open-source command-line proxy designed to optimize interactions between AI coding agents and the terminal by reducing unnecessary token consumption. When AI assistants execute shell commands during software development tasks, the resulting terminal output often contains large amounts of repetitive or irrelevant information that can overwhelm the model’s context window. RTK intercepts these command outputs and compresses them into concise summaries before sending them to the...
    Downloads: 44 This Week
    Last Update:
    See Project
  • 15
    AutoBE

    AutoBE

    AI Vibe Coding Agent of TS backend server

    ...Its main value is giving developers and non-programmers a structured way to generate backend systems from requirements while still producing documentation and tests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Anthropic's Original Performance

    Anthropic's Original Performance

    Anthropic's original performance take-home, now open for you to try

    ...The project sets up a baseline performance problem where participants work to reduce simulated “clock cycles” required to run a given workload, effectively challenging them to engineer faster code under constraints. This take-home includes starter code, tests, and tools to debug performance, aiming to measure how effectively one can apply algorithmic improvements and optimizations. Because it’s framed around beating baseline scores — and even outperforming previous automated systems — it encourages both deep knowledge of Python and creative problem-solving.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    CLI Printing Press

    CLI Printing Press

    Reads official API docs, studies CLI and MCP servers

    ...It supports multiple entry paths, including direct specs, URLs without documentation, and HAR imports from browser developer tools. The project includes validation stages such as scorecards, dogfooding, proof-of-behavior checks, and optional live read-only smoke tests. Overall, CLI Printing Press aims to turn any useful web service into a practical automation surface for both humans and AI agents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Happy Coder

    Happy Coder

    Mobile and Web client for Codex and Claude Code, with realtime voice

    ...You can start a coding session locally through the Happy CLI or connect from a phone or browser, allowing developers to inspect, interact with, and guide the AI as it generates, tests, or explains code. The project includes components like a dedicated backend server for encrypted sync, a rich front-end experience across web and native apps, and support for push notifications when your coding agent encounters permission requests or errors. Happy prioritizes security with end-to-end encryption so your code and interactions remain private and auditable.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 19
    agents.md

    agents.md

    A simple, open format for guiding coding agents

    ...Instead of putting everything in README or doc files (which are more human-oriented and might mix high-level narrative), AGENTS.md is intended to surface agent-relevant details that help them “do the right thing” (tests, style, project structure, tooling).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Agentless

    Agentless

    An agentless approach to automatically solve software development

    ...It then generates multiple candidate patches for the identified locations using language model reasoning and diff-style edits. In the final stage, the framework validates potential patches by running regression tests and additional reproduction tests to confirm whether the fix resolves the original error. Based on these results, the system ranks the candidate patches and selects the most reliable solution to submit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ArXiv MCP Server

    ArXiv MCP Server

    A Model Context Protocol server for searching and analyzing arXiv

    ...With simple tools like “search” and “fetch,” an agent can find papers, pull abstracts, and download PDFs for downstream summarization or analysis. The project includes packaging and CI to publish to PyPI, plus tests and linting for reliability. Issue threads show feature requests such as extracting embedded LaTeX and improving markdown conversion, reflecting active community use in research flows. It’s designed to be drop-in for MCP clients, giving them typed inputs/outputs and predictable errors around a well-known academic corpus. For developers building research copilots, it removes the glue work of wiring arXiv APIs into an agent toolchain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Kiro

    Kiro

    Kiro is an agentic IDE that works alongside you from prototype

    ...It introduces the concept of “specs,” which act as executable documentation that defines system behavior, constraints, and acceptance criteria, allowing developers to guide AI agents more precisely. Kiro also incorporates event-driven automation through “hooks,” enabling AI agents to perform tasks such as generating tests, updating documentation, or optimizing code whenever specific development events occur.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 23
    pixelmatch

    pixelmatch

    The smallest, simplest JavaScript pixel-level image comparison library

    The smallest, simplest and fastest JavaScript pixel-level image comparison library, originally created to compare screenshots in tests. Features accurate anti-aliased pixels detection and perceptual color difference metrics. Inspired by Resemble.js and Blink-diff. Unlike these libraries, pixelmatch is around 150 lines of code, has no dependencies, and works on raw typed arrays of image data, so it's blazing fast and can be used in any environment (Node or browsers).
    Downloads: 6 This Week
    Last Update:
    See Project
  • 24
    AI Chatbot Framework

    AI Chatbot Framework

    Python chatbot framework with Natural Language Understanding

    Building a chatbot can sound daunting, but it’s totally doable. AI Chatbot Framework is an AI powered conversational dialog interface built in Python. With this tool, it’s easy to create Natural Language conversational scenarios with no coding efforts whatsoever. The smooth UI makes it effortless to create and train conversations to the bot and it continuously gets smarter as it learns from conversations it has with people. AI Chatbot Framework can live on any channel of your choice (such as...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    Emdash

    Emdash

    Emdash is the Open-Source Agentic Development Environment

    ...Emdash integrates deeply with development workflows by enabling users to pass tasks directly from issue trackers like GitHub, Jira, or Linear to agents, which can then generate code, run tests, and create pull requests automatically. It also supports remote development through SSH, allowing agents to operate on remote servers while maintaining the same interface and workflow.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
Auth0 Logo