Showing 66 open source projects for "performance testing"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Made With ML

    Made With ML

    Learn how to develop, deploy and iterate on production-grade ML

    ...The project focuses on bridging the gap between experimental machine learning notebooks and real-world software systems that can be deployed, monitored, and maintained at scale. It provides structured lessons and practical code examples that demonstrate how to design machine learning workflows, manage datasets, train models, evaluate performance, and deploy inference services. The repository organizes these concepts into modular Python scripts that follow software engineering best practices such as testing, configuration management, logging, and version control. Through a combination of tutorials, notebooks, and production-ready scripts, the project demonstrates how machine learning applications should be developed as maintainable systems rather than isolated experiments.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    MiniMax-M2.5

    MiniMax-M2.5

    State of the art LLM and coding model

    MiniMax-M2.5 is a state-of-the-art foundation model extensively trained with reinforcement learning across hundreds of thousands of real-world environments. It delivers leading performance in coding, agentic tool use, search, and complex office workflows, achieving top benchmark scores such as 80.2% on SWE-Bench Verified and 76.3% on BrowseComp. Designed to reason efficiently and decompose tasks like an experienced architect, M2.5 plans features, structure, and system design before generating code. The model supports full-stack development across web, mobile, and desktop platforms, covering the entire lifecycle from system design to testing and code review. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    Hallucination Leaderboard

    Hallucination Leaderboard

    Leaderboard Comparing LLM Performance at Producing Hallucinations

    Hallucination Leaderboard is an open research project that tracks and compares the tendency of large language models to produce hallucinated or inaccurate information when generating summaries. The project provides a standardized benchmark that evaluates different models using a dedicated hallucination detection system known as the Hallucination Evaluation Model. Each model is tested on document summarization tasks to measure how often generated responses introduce information that is not...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Kiln

    Kiln

    Open source platform for managing, testing, and deploying AI apps

    ...Kiln emphasizes reproducibility, enabling users to track changes to prompts and models while comparing outputs across different configurations. Kiln also supports systematic testing of AI systems by defining evaluation criteria and running experiments to assess performance over time. Its workflow-oriented approach helps teams move from experimentation to production by organizing assets and results in a consistent format. It is particularly useful for teams working with large language models who need visibility into how changes impact outputs and overall system quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    ComfyUI-WanVideoWrapper

    ComfyUI-WanVideoWrapper

    ComfyUI wrapper nodes for WanVideo and related models

    The ComfyUI-WanVideoWrapper project is a custom node extension for ComfyUI that enables advanced video generation workflows using WanVideo diffusion models. It acts as a standalone wrapper layer that allows developers and creators to integrate experimental features and models without modifying the core ComfyUI codebase. This design makes it easier to rapidly test new capabilities such as text-to-video and image-to-video generation while avoiding compatibility issues with the main framework....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MiMo-V2-Flash

    MiMo-V2-Flash

    MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation

    MiMo-V2-Flash is a large Mixture-of-Experts language model designed to deliver strong reasoning, coding, and agentic-task performance while keeping inference fast and cost-efficient. It uses an MoE setup where a very large total parameter count is available, but only a smaller subset is activated per token, which helps balance capability with runtime efficiency. The project positions the model for workflows that require tool use, multi-step planning, and higher throughput, rather than only...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    ChainForge

    ChainForge

    An open-source visual programming environment

    ChainForge is an open-source visual programming environment designed to help developers systematically test, compare, and evaluate prompts and outputs across multiple large language models in a structured and scalable way. Instead of relying on isolated prompt experimentation, it introduces a dataflow-based interface that allows users to create complex prompt pipelines and evaluate them across different models, parameters, and datasets simultaneously. The platform enables rapid...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Browserbase MCP Server

    Browserbase MCP Server

    Allow LLMs to control a browser with Browserbase and Stagehand

    ...The project provides a standardized interface for connecting AI systems to real-world web environments, allowing them to navigate pages, extract structured data, and perform user-like actions such as clicking, typing, and form submission. It leverages Browserbase infrastructure along with Stagehand to deliver high-performance browser automation with improved speed and efficiency through caching and optimized execution pipelines. The system supports multiple AI models and integrates seamlessly into agent workflows, making it suitable for applications such as web scraping, testing, and intelligent automation. It also includes advanced capabilities such as screenshot capture, DOM analysis, and session persistence, enabling complex interactions across multiple browsing sessions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Humanoid-Gym

    Humanoid-Gym

    Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real

    Humanoid-Gym is a reinforcement learning framework designed to train locomotion and control policies for humanoid robots using high-performance simulation environments. The system is built on top of NVIDIA Isaac Gym, which allows large-scale parallel simulation of robotic environments directly on GPU hardware. Its primary goal is to enable efficient training of humanoid robots in simulation while enabling policies to transfer effectively to real-world hardware without additional training....
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Responsible AI Toolbox

    Responsible AI Toolbox

    Responsible AI Toolbox is a suite of tools providing model

    Responsible AI Toolbox is a software framework designed to help developers evaluate and improve the reliability, fairness, and transparency of machine learning systems. The project provides tools that assist in analyzing model behavior, detecting bias, improving robustness, and explaining predictions produced by AI systems. It is designed to integrate with common machine learning frameworks, especially PyTorch, allowing developers to apply responsible AI techniques within existing workflows....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Arthur Bench

    Arthur Bench

    Bench is a tool for evaluating LLMs for production use cases

    Bench is a tool for evaluating LLMs for production use cases. Whether you are comparing different LLMs, considering different prompts, or testing generation hyperparameters like temperature and # tokens, Bench provides one touch point for all your LLM performance evaluation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    CSM (Conversational Speech Model)

    CSM (Conversational Speech Model)

    A Conversational Speech Generation Model

    The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    GB PavementMat

    GB PavementMat

    An AI-based software for prediction of pavement material properties

    During pavement quality control, numerous laboratory tests such as California Bearing Ratio (CBR), Proctor compaction test, and Plasticity Index (PI) are conducted to evaluate material performance and suitability. GB PavementMat leverages advanced AI-based models developed using artificial neural networks to accurately predict these critical properties directly from sieve analysis data. This significantly reduces laboratory testing time, cost, and delays while maintaining high reliability. The combined prediction model achieves an accuracy of 97%, while the dedicated Proctor test model reaches an accuracy of 98%, delivering fast, intelligent, and dependable pavement material assessment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ZeroStep

    ZeroStep

    Supercharge your Playwright tests with AI

    ZeroStep is a tool that enhances Playwright tests with AI capabilities, aiming to improve the efficiency and effectiveness of end-to-end testing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    HttpRunner

    HttpRunner

    Testing framework that began with API and performance testing

    HttpRunner is an open-source testing framework that began with API and performance testing and has evolved into a general, extensible test platform. The current major version is implemented in Go, with the legacy Python edition split to a separate repository; this shift emphasizes a single, fast, cross-platform runtime for modern pipelines. It provides declarative test cases, data-driven parametrization, and plugin mechanisms so teams can compose reusable steps and validations at scale. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Alink

    Alink

    Alink is the Machine Learning algorithm platform based on Flink

    Alink is Alibaba’s scalable machine learning algorithm platform built on Apache Flink, designed for batch and stream data processing. It provides a wide variety of ready-to-use ML algorithms for tasks like classification, regression, clustering, recommendation, and more. Written in Java and Scala, Alink is suitable for enterprise-grade big data applications where performance and scalability are crucial. It supports model training, evaluation, and deployment in real-time environments and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    WebArena

    WebArena

    Code repo for "WebArena to build Autonomous Agents

    WebArena is a realistic web environment designed for building and testing autonomous agents, providing a platform for developing web-based AI agents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    spaGO

    spaGO

    Self-contained Machine Learning and Natural Language Processing lib

    ...Except for a few linear algebra operations written in assembly for optimal performance (a bit of copying from Gonum), it's straightforward Go code, so you don't have to worry.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    AnyTrading

    AnyTrading

    The most simple, flexible, and comprehensive OpenAI Gym trading

    gym-anytrading is an OpenAI Gym-compatible environment designed for developing and testing reinforcement learning algorithms on trading strategies. It simulates trading environments for financial markets, including stocks and forex.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Coframe

    Coframe

    Coframe brings your UX to life with AI-powered optimization

    ...Coframe brings the content of your app or website to life through AI-powered optimization, personalization, and overall self-improvement. It takes minutes to integrate, and the ROI is clear to measure. Your website or app gains self-enhancing abilities with Coframe, learning from real-world performance. It's A/B testing, but with a serious upgrade. Coframe uses the latest in AI to generate copy that is tailored to your users. Resulting performance data is fed back in to continuously improve your platform's content. With Coframe, your website or app works for you 24/7, not the other way around. All it takes to get up and running is a few lines of code. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Hyperformer

    Hyperformer

    Hypergraph Transformer for Skeleton-based Action Recognition

    This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition." Skeleton-based action recognition aims to recognize human actions given human joint coordinates with skeletal interconnections. By defining a graph with joints as vertices and their natural connections as edges, previous works successfully adopted Graph Convolutional networks (GCNs) to model joint co-occurrences and achieved superior performance. More recently, a limitation of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OpenNN - Open Neural Networks Library

    OpenNN - Open Neural Networks Library

    Machine learning algorithms for advanced analytics

    OpenNN is a software library written in C++ for advanced analytics. It implements neural networks, the most successful machine learning method. Some typical applications of OpenNN are business intelligence (customer segmentation, churn prevention…), health care (early diagnosis, microarray analysis…) and engineering (performance optimization, predictive maitenance…). OpenNN does not deal with computer vision or natural language processing. The main advantage of OpenNN is its...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    flutter_ume

    flutter_ume

    UME is an in-app debug kits platform for Flutter

    flutter_ume is an in-app debug-kit platform for Flutter applications, developed by ByteDance’s Flutter Infra team. It lets developers embed a suite of debugging tools directly into a Flutter app (during development or debug builds), enabling inspection, performance monitoring, UI debugging, network request inspection, widget hierarchy introspection, and more — all from within the running app. UME bundles multiple “plugin kits” (e.g., UI inspector, performance monitor, device info panel, network inspector via Dio, console log viewer, code viewer, touch indicator) so developers can enable only the tools they need. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    Procgen

    Procgen

    Procedurally-Generated Game-Like Gym-Environments

    Procgen (short for Procedural Generation Benchmark) is a suite of 16 procedurally generated, game-like reinforcement learning environments designed to evaluate generalization and sample efficiency in RL agents. Unlike fixed, deterministic environments, Procgen generates new levels (layouts, obstacles, visual variation) each episode, making it impossible for an agent to simply memorize trajectories. The environments are designed to run very quickly (thousands of steps per second on a single...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Machine Learning Financial Laboratory

    Machine Learning Financial Laboratory

    MlFinLab helps portfolio managers and traders

    ...Its architecture emphasizes reproducibility, robust testing, and well-documented code so that researchers and practitioners can reliably experiment with financial machine learning models.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB