performance testing free download

28 projects for "performance testing" with 2 filters applied:

Artificial Intelligence ChromeOS Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

Nestia

NestJS Helper + AI Chatbot Development

Nestia is a high-performance toolkit and ecosystem built on top of NestJS that enhances backend development by introducing strongly typed APIs, automated SDK generation, and advanced tooling for scalable server applications. It is designed to eliminate much of the boilerplate typically associated with API development by leveraging pure TypeScript types to automatically generate validation logic, API documentation, and client SDKs. One of its defining advantages is its focus on performance...

Downloads: 7 This Week

Last Update: 2026-05-02
See Project
2

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

...These environments require agents to interpret instructions, take actions, and adapt their strategies based on feedback from the environment. AgentBench also includes an evaluation framework that measures success rates, rewards, and task completion performance across different agent implementations. By testing models across diverse scenarios, the benchmark highlights strengths and weaknesses in reasoning, long-term planning, and tool usage.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
3

Beehave

Behavior tree AI for Godot Engine

Beehave is a powerful AI behavior tree framework designed as an addon for the Godot game engine, enabling developers to create sophisticated and dynamic non-player character behaviors in games. It uses a node-based system that integrates directly into the Godot scene tree, allowing developers to visually design and organize complex AI logic in a structured and intuitive way. Behavior trees provide a modular approach to decision-making, making it easier to manage large and adaptive AI systems...

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
4

TextWorld

TextWorld is a sandbox learning environment for the training

TextWorld is a learning environment designed to train reinforcement learning agents to play text-based games, where actions and observations are entirely in natural language. Developed by Microsoft Research, TextWorld focuses on language understanding, planning, and interaction in complex, narrative-driven environments. It generates games procedurally, enabling scalable testing of agents’ natural language processing and decision-making abilities.

Downloads: 5 This Week

Last Update: 2026-01-30
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

AutoAgent AI

Autonomous harness engineering

...The system is particularly useful for building domain-specific agents that need continuous performance improvement.

Downloads: 4 This Week

Last Update: 2026-04-28
See Project
6

Agent Behavior Monitoring

The open source post-building layer for agents

Agent Behavior Monitoring is an open-source framework designed to monitor, evaluate, and improve the behavior of AI agents operating in real or simulated environments. The system focuses on agent behavior monitoring by collecting interaction data and analyzing how agents perform across different scenarios and tasks. Developers can use the framework to observe agent actions in both online production environments and offline evaluation settings, making it useful for debugging and performance...

Downloads: 5 This Week

Last Update: 2026-04-09
See Project
7

LangWatch

The platform for LLM evaluations and AI agent testing

LangWatch is an open-source observability and monitoring platform designed to help developers evaluate and improve applications built with large language models. The platform provides tools for tracking model interactions, analyzing prompt behavior, and identifying issues such as hallucinations, latency problems, or unexpected responses. By collecting telemetry data from AI applications, LangWatch allows developers to understand how their systems perform in real-world usage scenarios. The...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
8

Paddler

Open-source LLM load balancer and serving platform for hosting LLMs

Paddler is an open-source LLM infrastructure platform designed to deploy, manage, and scale large language models on private infrastructure. The system acts as a specialized load balancer and serving layer for language models, enabling organizations to run inference workloads without relying on external API providers. It supports running models locally through engines such as llama.cpp while distributing requests across multiple compute nodes to improve performance and reliability. The...

Downloads: 0 This Week

Last Update: 2026-04-30
See Project
9

NOFX

Open source AI trading OS for autonomous multi-model trading systems

NOFX is an open source AI-powered trading operating system designed to automate financial trading workflows using autonomous AI agents. It acts as an infrastructure layer that transforms market data into AI-driven trade decisions and execution. Instead of requiring users to manually configure machine learning models, data sources, and API integrations, the system allows AI components to perceive market conditions, select models, and perform trading actions automatically. It supports running...

Downloads: 5 This Week

Last Update: 24 hours ago
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
10

GLM-4.1V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

...Given its open-source availability under the same project repository, it provides an accessible entry point for testing multimodal reasoning and building proof-of-concept applications.

Downloads: 0 This Week

Last Update: 2026-04-06
See Project
11

Made With ML

Learn how to develop, deploy and iterate on production-grade ML

...The project focuses on bridging the gap between experimental machine learning notebooks and real-world software systems that can be deployed, monitored, and maintained at scale. It provides structured lessons and practical code examples that demonstrate how to design machine learning workflows, manage datasets, train models, evaluate performance, and deploy inference services. The repository organizes these concepts into modular Python scripts that follow software engineering best practices such as testing, configuration management, logging, and version control. Through a combination of tutorials, notebooks, and production-ready scripts, the project demonstrates how machine learning applications should be developed as maintainable systems rather than isolated experiments.

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
12

Hallucination Leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations

Hallucination Leaderboard is an open research project that tracks and compares the tendency of large language models to produce hallucinated or inaccurate information when generating summaries. The project provides a standardized benchmark that evaluates different models using a dedicated hallucination detection system known as the Hallucination Evaluation Model. Each model is tested on document summarization tasks to measure how often generated responses introduce information that is not...

Downloads: 1 This Week

Last Update: 2026-04-29
See Project
13

Kiln

Open source platform for managing, testing, and deploying AI apps

...Kiln emphasizes reproducibility, enabling users to track changes to prompts and models while comparing outputs across different configurations. Kiln also supports systematic testing of AI systems by defining evaluation criteria and running experiments to assess performance over time. Its workflow-oriented approach helps teams move from experimentation to production by organizing assets and results in a consistent format. It is particularly useful for teams working with large language models who need visibility into how changes impact outputs and overall system quality.

Downloads: 0 This Week

Last Update: 2026-04-23
See Project
14

MiMo-V2-Flash

MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation

MiMo-V2-Flash is a large Mixture-of-Experts language model designed to deliver strong reasoning, coding, and agentic-task performance while keeping inference fast and cost-efficient. It uses an MoE setup where a very large total parameter count is available, but only a smaller subset is activated per token, which helps balance capability with runtime efficiency. The project positions the model for workflows that require tool use, multi-step planning, and higher throughput, rather than only...

Downloads: 6 This Week

Last Update: 2026-01-08
See Project
15

ChainForge

An open-source visual programming environment

ChainForge is an open-source visual programming environment designed to help developers systematically test, compare, and evaluate prompts and outputs across multiple large language models in a structured and scalable way. Instead of relying on isolated prompt experimentation, it introduces a dataflow-based interface that allows users to create complex prompt pipelines and evaluate them across different models, parameters, and datasets simultaneously. The platform enables rapid...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
16

Browserbase MCP Server

Allow LLMs to control a browser with Browserbase and Stagehand

...The project provides a standardized interface for connecting AI systems to real-world web environments, allowing them to navigate pages, extract structured data, and perform user-like actions such as clicking, typing, and form submission. It leverages Browserbase infrastructure along with Stagehand to deliver high-performance browser automation with improved speed and efficiency through caching and optimized execution pipelines. The system supports multiple AI models and integrates seamlessly into agent workflows, making it suitable for applications such as web scraping, testing, and intelligent automation. It also includes advanced capabilities such as screenshot capture, DOM analysis, and session persistence, enabling complex interactions across multiple browsing sessions.

Downloads: 1 This Week

Last Update: 2026-03-31
See Project
17

Humanoid-Gym

Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real

Humanoid-Gym is a reinforcement learning framework designed to train locomotion and control policies for humanoid robots using high-performance simulation environments. The system is built on top of NVIDIA Isaac Gym, which allows large-scale parallel simulation of robotic environments directly on GPU hardware. Its primary goal is to enable efficient training of humanoid robots in simulation while enabling policies to transfer effectively to real-world hardware without additional training....

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
18

Responsible AI Toolbox

Responsible AI Toolbox is a suite of tools providing model

Responsible AI Toolbox is a software framework designed to help developers evaluate and improve the reliability, fairness, and transparency of machine learning systems. The project provides tools that assist in analyzing model behavior, detecting bias, improving robustness, and explaining predictions produced by AI systems. It is designed to integrate with common machine learning frameworks, especially PyTorch, allowing developers to apply responsible AI techniques within existing workflows....

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
19

CSM (Conversational Speech Model)

A Conversational Speech Generation Model

The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.

Downloads: 4 This Week

Last Update: 2025-03-19
See Project
20

AnyTrading

The most simple, flexible, and comprehensive OpenAI Gym trading

gym-anytrading is an OpenAI Gym-compatible environment designed for developing and testing reinforcement learning algorithms on trading strategies. It simulates trading environments for financial markets, including stocks and forex.

Downloads: 3 This Week

Last Update: 2025-03-13
See Project
21

OpenNN - Open Neural Networks Library

Machine learning algorithms for advanced analytics

OpenNN is a software library written in C++ for advanced analytics. It implements neural networks, the most successful machine learning method. Some typical applications of OpenNN are business intelligence (customer segmentation, churn prevention…), health care (early diagnosis, microarray analysis…) and engineering (performance optimization, predictive maitenance…). OpenNN does not deal with computer vision or natural language processing. The main advantage of OpenNN is its...

8 Reviews

Downloads: 4 This Week

Last Update: 2026-04-23
See Project
22

flutter_ume

UME is an in-app debug kits platform for Flutter

flutter_ume is an in-app debug-kit platform for Flutter applications, developed by ByteDance’s Flutter Infra team. It lets developers embed a suite of debugging tools directly into a Flutter app (during development or debug builds), enabling inspection, performance monitoring, UI debugging, network request inspection, widget hierarchy introspection, and more — all from within the running app. UME bundles multiple “plugin kits” (e.g., UI inspector, performance monitor, device info panel, network inspector via Dio, console log viewer, code viewer, touch indicator) so developers can enable only the tools they need. ...

Downloads: 3 This Week

Last Update: 2025-12-02
See Project
23

Machine Learning Financial Laboratory

MlFinLab helps portfolio managers and traders

...Its architecture emphasizes reproducibility, robust testing, and well-documented code so that researchers and practitioners can reliably experiment with financial machine learning models.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
24
$Grade School Math$

Grade School Math

8.5K high quality grade school math problems

The grade-school-math repository (sometimes called GSM8K) is a curated dataset of 8,500 high-quality grade school math word problems intended for evaluating mathematical reasoning capabilities of language models. It is structured into 7,500 training problems and 1,000 test problems. These aren’t trivial exercises — many require multi-step reasoning, combining arithmetic operations, and handling intermediate steps (e.g. “If she sold half as many in May… how many in total?”). The problems are...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
25

DeepTraffic

DeepTraffic is a deep reinforcement learning competition

DeepTraffic is a deep reinforcement learning simulation designed to teach and evaluate autonomous driving algorithms in a dense highway environment. The system presents a simulated multi-lane highway where an AI-controlled vehicle must navigate traffic while maximizing speed and avoiding collisions. Participants design neural network policies that determine the vehicle’s actions, such as accelerating, decelerating, changing lanes, or maintaining speed. The project was created as part of an...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project