Showing 93 open source projects for "tests"

View related business solutions
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    Emdash

    Emdash

    Emdash is the Open-Source Agentic Development Environment

    ...Emdash integrates deeply with development workflows by enabling users to pass tasks directly from issue trackers like GitHub, Jira, or Linear to agents, which can then generate code, run tests, and create pull requests automatically. It also supports remote development through SSH, allowing agents to operate on remote servers while maintaining the same interface and workflow.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    rep+

    rep+

    Burp-style HTTP Repeater for Chrome DevTools with built‑in AI

    rep+ is a lightweight browser extension for Chrome DevTools that brings a Burp Suite-style HTTP repeater directly into the developer console, enhanced with built-in AI to help explain requests and suggest tests. It captures HTTP traffic from the inspected page without needing a proxy, allowing users to replay, modify, and analyze individual requests with fine-grained control over headers, bodies, and methods. The tool offers hierarchical grouping, tagging, and filtering of captured requests so that developers and security testers can manage complex traffic flows efficiently. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    CodiumAI PR-Agent

    CodiumAI PR-Agent

    AI-Powered tool for automated pull request analysis

    CodiumAI PR-Agent is an open-source tool aiming to help developers review pull requests faster and more efficiently. It automatically analyzes the pull request and can provide several types of commands. See the Usage Guide for instructions how to run the different tools from CLI, online usage, Or by automatically triggering them when a new PR is opened. You can try GPT-4 powered PR-Agent, on your public GitHub repository, instantly. Just mention @CodiumAI-Agent and add the desired command in...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    ArrayFire

    ArrayFire

    ArrayFire, a general purpose GPU library

    ...Together we can fulfill The ArrayFire Mission under an excellent Code of Conduct that promotes a respectful and friendly building experience. Rigorous benchmarks and tests ensuring top performance and numerical accuracy. Cross-platform compatibility with support for CUDA, OpenCL, and native CPU on Windows, Mac, and Linux. Built-in visualization functions through Forge.
    Downloads: 7 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    darwin-skill

    darwin-skill

    Autoresearch-inspired autonomous skill optimization for Claude Code

    darwin-skill is an experimental framework designed to automatically improve AI agent “skills” through iterative evaluation and optimization loops inspired by machine learning training processes. Instead of treating prompts or skill definitions as static assets, the system applies a continuous improvement cycle that evaluates performance, proposes changes, tests outcomes, and either retains or reverts modifications. The framework introduces a scoring system across multiple dimensions, enabling quantitative assessment of skill quality and ensuring that only improvements are preserved over time. It incorporates a “ratchet mechanism” similar to version control workflows, guaranteeing that performance never degrades as iterations progress. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Serena

    Serena

    Agent toolkit providing semantic retrieval and editing capabilities

    Serena is a coding-focused agent toolkit that turns an LLM into a practical software-engineering agent with semantic retrieval and editing over real repositories. It operates as an MCP server (and other integrations), exposing IDE-like tools so agents can locate symbols, reason about code structure, make targeted edits, and validate changes. The toolkit is LLM-agnostic and framework-agnostic, positioning itself as a drop-in capability for different chat UIs, orchestrators, or custom agent...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    AIPex

    AIPex

    AI browser automation assistant, no migration and privacy first

    AIPex is an AI-augmented development toolkit and workflow platform that aims to accelerate software productivity by integrating intelligent assistants, code generation tools, and customizable automation patterns directly into developer workflows. Rather than treating AI as a separate helper, AIPex embeds AI capabilities into common tasks like scaffolding components, generating tests, analyzing code quality, and performing refactors, allowing developers to stay in flow while benefiting from model-assisted insights. It supports modular plugin architecture so teams can extend or customize how assistants behave based on project conventions, code standards, or tooling preferences. AIPex also includes orchestration pipelines that let teams define multi-step AI-driven transformations — for example, generating code then running validation, producing documentation, and opening change requests — all within a unified pattern.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    pmdarima

    pmdarima

    Statistical library designed to fill the void in Python's time series

    A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    GLM-4.5

    GLM-4.5

    GLM-4.5: Open-source LLM for intelligent agents by Z.ai

    GLM-4.5 is a cutting-edge open-source large language model designed by Z.ai for intelligent agent applications. The flagship GLM-4.5 model has 355 billion total parameters with 32 billion active parameters, while the compact GLM-4.5-Air version offers 106 billion total parameters and 12 billion active parameters. Both models unify reasoning, coding, and intelligent agent capabilities, providing two modes: a thinking mode for complex reasoning and tool usage, and a non-thinking mode for...
    Downloads: 37 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    LLM Colosseum

    LLM Colosseum

    Benchmark LLMs by fighting in Street Fighter 3

    ...The system places language models inside the environment of the classic video game Street Fighter III, where they must interpret the game state and decide which actions to perform during combat. This setup creates a dynamic environment that tests reasoning, situational awareness, and decision-making abilities in real time. Instead of relying purely on reward signals as in reinforcement learning agents, the models analyze contextual information and generate strategic actions based on the game environment. Performance is evaluated using a competitive ranking system that assigns models an ELO rating based on their results across matches against other models.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Smile

    Smile

    Statistical machine intelligence and learning engine

    Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    gTTS

    gTTS

    Python library and CLI tool to interface with Google Translate

    gTTS (Google Text-to-Speech) is a Python library and command-line tool that wraps the speech functionality of Google Translate. It lets you send text to the Google Translate TTS endpoint and receive spoken audio back as MP3 data, either written to a file, a file-like object, or standard output. The library is designed to handle long texts, using a speech-specific sentence tokenizer that keeps intonation and punctuation natural while splitting requests into acceptable chunks. It supports...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    Bifrost

    Bifrost

    The Fastest LLM Gateway with built in OTel observability

    ...It abstracts away the complexity of working directly with multiple backend providers (OpenAI, Anthropic, AWS Bedrock, Google Vertex, etc.), enabling you to plug in providers and switch between them without touching your client code. It is built to be high performance: in benchmark tests at 5,000 requests per second, it reportedly adds only microseconds of overhead and achieves perfect success rates with no failed requests. Bifrost supports features such as automatic fallback (failover between providers), load balancing across API keys/providers, and semantic caching to reduce latency and cost. It also includes observability with built-in metrics, tracing, logging, and supports governance features like rate limiting, access control, and cost budgeting. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    NVIDIA Earth2Studio

    NVIDIA Earth2Studio

    Open-source deep-learning framework

    ...Users can extend Earth2Studio with optional model packs, advanced data interfaces, statistical operators, and backend integrations that support flexible workflows from simple tests to large-scale operational inference.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Expect

    Expect

    Let agents test your code in a real browser

    ...It may support chaining conditions, enabling complex validation logic without introducing unnecessary verbosity. The design suggests a focus on productivity, reducing cognitive load when writing and reviewing tests or validation scripts. It is likely adaptable across multiple contexts, including unit testing, integration testing, and runtime assertions. By abstracting repetitive validation logic, expect helps developers focus on behavior rather than implementation details. Overall, it serves as a lightweight but powerful tool for improving software reliability and clarity in testing workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    OpenReview

    OpenReview

    An open-source, self-hosted AI code review bot powered by Vercel

    ...Built by Vercel Labs, it integrates directly with GitHub workflows, allowing developers to trigger intelligent code reviews by simply mentioning a bot in a pull request. The system operates in a sandboxed environment with access to the repository, enabling it to run linters, tests, and formatting tools as part of its review process. It provides detailed, line-by-line feedback and can suggest or even apply fixes directly to the codebase. OpenReview is designed for extensibility, supporting custom review skills that can be tailored to specific development needs or coding standards. Its architecture leverages Vercel’s infrastructure for scalable and reliable execution, ensuring that reviews can be resumed or retried if interrupted.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Robyn

    Robyn

    Experimental, AI/ML-powered and open sourced Marketing Mix Modeling

    Robyn is an open-source, AI/ML-powered Marketing Mix Modeling (MMM) toolkit developed by Meta Marketing Science under the “facebookexperimental” GitHub umbrella. Its goal is to democratize rigorous MMM: what traditionally required expert statisticians and expensive consulting becomes accessible to any company with data. Robyn takes in historical data (spends on different marketing channels, conversions, or revenue, and optional context or organic-media variables) and uses a combination of...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    AI Marketing Skills

    AI Marketing Skills

    Open-source AI marketing skills for Claude Code

    ...The system is organized into multiple domains such as growth experimentation, sales pipeline generation, content production, outbound marketing, SEO optimization, and financial analysis, effectively covering the entire revenue lifecycle of a business. Each skill functions as an executable capability that can be invoked on demand, enabling users to perform tasks like running A/B tests, generating high-quality content, or analyzing conversion funnels with minimal manual effort.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Devon

    Devon

    Open source AI pair programmer for coding, debugging, automation

    ...Devon integrates with multiple large language models, allowing users to choose between different providers for performance, cost, and latency considerations. It is capable of performing tasks such as debugging, writing tests, analyzing code structure, and navigating complex repositories. Devon also includes features for session management, enabling users to start, pause, and revert actions while maintaining context.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    AWS Deep Learning Containers

    AWS Deep Learning Containers

    A set of Docker images for training and serving models in TensorFlow

    AWS Deep Learning Containers (DLCs) are a set of Docker images for training and serving models in TensorFlow, TensorFlow 2, PyTorch, and MXNet. Deep Learning Containers provide optimized environments with TensorFlow and MXNet, Nvidia CUDA (for GPU instances), and Intel MKL (for CPU instances) libraries and are available in the Amazon Elastic Container Registry (Amazon ECR). The AWS DLCs are used in Amazon SageMaker as the default vehicles for your SageMaker jobs such as training, inference,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    ASSERT

    ASSERT

    Requirement-driven evaluation harness for AI agents and LLM

    ASSERT is a requirement-driven evaluation harness for AI agents and LLM applications. It turns natural-language specifications, policies, product requirements, and launch criteria into structured tests that can be reviewed, executed, scored, and improved. The pipeline derives behavior categories, generates single-turn and multi-turn test cases, runs them against a target system, and uses an LLM judge to score conversations against the stated policies. It can evaluate hosted models, custom agents, multi-agent systems, REST clients, and frameworks such as LangGraph, CrewAI, AutoGen, DSPy, LlamaIndex, and OpenAI Agents SDK. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MiroThinker

    MiroThinker

    MiroThinker is an open source deep research agent

    ...The platform is optimized for research tasks such as financial forecasting, knowledge discovery, and large-scale information synthesis. MiroThinker has been evaluated on several agent benchmarks and has demonstrated strong performance on tests designed to measure deep research capabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    VibeTensor

    VibeTensor

    Our first fully AI generated deep learning system

    ...What makes VibeTensor remarkable is that every major component, from core libraries and dispatch systems to CUDA runtime support, caching allocators, and language bindings, was created and validated by coding agents using automated builds and tests rather than manual line-by-line human coding. The system includes both a Python frontend via a torch-like API and an experimental Node.js/TypeScript interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Claude Code Subagents Command Collection

    Claude Code Subagents Command Collection

    Claude Code Subagents & Commands Collection + CLI Tool

    This repository aggregates a large set of specialized subagents and slash commands designed for Claude Code, giving developers domain-focused “teammates” they can summon on demand. Each subagent is defined by a concise role, tools, and behaviors, and ships as Markdown you can drop into your .claude/agents/ directory. The collection targets common developer workflows such as scaffolding, refactoring, test writing, documentation, security checks, and project management. It includes a CLI...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    promptmap2

    promptmap2

    A security scanner for custom LLM applications

    promptmap is an automated security scanner for custom LLM applications that focuses on prompt injection and related attack classes. The project supports both white-box and black-box testing, which means it can either run tests directly against a known model and system prompt configuration or attack an external HTTP endpoint without internal access. Its scanning workflow uses a dual-LLM architecture in which one model acts as the target being tested and another acts as a controller that evaluates whether an attack succeeded. The repository emphasizes broad coverage, including test rules for prompt stealing, jailbreaks, harmful content generation, hate-related outputs, social bias, and distraction attacks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo