Page 5 | benchmark free download

Showing 125 open source projects for "benchmark"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

Chameleon LLM

Codes for "Chameleon: Plug-and-Play Compositional Reasoning

...By integrating various tools such as vision models, web search engines, Python functions, and rule-based modules, Chameleon delivers more accurate, up-to-date, and precise responses, making it a game-changer in the natural language processing landscape. With GPT-4 at its core, Chameleon has showcased exceptional improvements in accuracy on benchmark tasks, outperforming competitors and setting new industry standards.

Downloads: 0 This Week

Last Update: 2023-08-25
See Project
2

DIG

A library for graph deep learning research

...It includes unified implementations of data interfaces, common algorithms, and evaluation metrics for several advanced tasks. Our goal is to enable researchers to easily implement and benchmark algorithms.

Downloads: 0 This Week

Last Update: 2023-04-03
See Project
3

Merlion

A Machine Learning Framework for Time Series Intelligence

...It supports various time series learning tasks, including forecasting, anomaly detection, and change point detection for both univariate and multivariate time series. This library aims to provide engineers and researchers a one-stop solution to rapidly develop models for their specific time series needs, and benchmark them across multiple time series datasets.

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
4

BCI

BCI: Breast Cancer Immunohistochemical Image Generation

...The routine evaluation of HER2 is conducted with immunohistochemical techniques (IHC), which is very expensive. Therefore, for the first time, we propose a breast cancer immunohistochemical (BCI) benchmark attempting to synthesize IHC data directly with the paired hematoxylin and eosin (HE) stained images.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

StudioGAN

StudioGAN is a Pytorch library providing implementations of networks

...StudioGAN aims to offer an identical playground for modern GANs so that machine learning researchers can readily compare and analyze a new idea. Moreover, StudioGAN provides an unprecedented-scale benchmark for generative models. The benchmark includes results from GANs (BigGAN-Deep, StyleGAN-XL), auto-regressive models (MaskGIT, RQ-Transformer), and Diffusion models (LSGM++, CLD-SGM, ADM-G-U). StudioGAN is a self-contained library that provides 7 GAN architectures, 9 conditioning methods, 4 adversarial losses, 13 regularization modules, 6 augmentation modules, 8 evaluation metrics, and 5 evaluation backbones. ...

Downloads: 0 This Week

Last Update: 2022-08-04
See Project
6

Fashion-MNIST

A MNIST-like fashion product database

...Each image has a resolution of 28 by 28 pixels and belongs to one of ten clothing classes, making it suitable for evaluating classification models. Because the dataset represents real-world objects rather than handwritten digits, it offers a more challenging benchmark for testing machine learning algorithms.

Downloads: 3 This Week

Last Update: 2026-03-10
See Project
7

RLCard

Reinforcement Learning / AI Bots in Card (Poker) Games

RLCard is a toolkit for reinforcement learning research on card games. It includes several popular card games and focuses on learning algorithms for imperfect information games like poker and blackjack.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
8

CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code

CodeSearchNet is a large-scale dataset and research benchmark designed to advance the development of systems that retrieve source code using natural language queries. The project was created through collaboration between GitHub and Microsoft Research and aims to support research on semantic code search and program understanding. The dataset contains millions of pairs of source code functions and corresponding documentation comments extracted from open-source repositories.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
9

GiantMIDI-Piano

Classical piano MIDI dataset

GiantMIDI-Piano is a large-scale symbolic classical piano music dataset built by applying the piano_transcription system on a vast collection of piano performance recordings. The dataset contains thousands of piano works, spanning a large number of composers and styles, with each piece transcribed into high-precision MIDI files capturing note events, pedal usage, velocities, etc. It provides a resource for music information retrieval (MIR), symbolic music modeling, composer classification,...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.

Start Free Trial
10
$Grade School Math$

Grade School Math

8.5K high quality grade school math problems

...The problems are written by human authors (not automatically generated) to ensure linguistic variety and realism. The repository maintains strict formatting (e.g. JSONL) for problem + answer pairs, and is used broadly in research to benchmark model performance under “word problem” settings. Issues are tracked (people report incorrect problems, ambiguous statements), and contributions are possible for cleaning or expanding the set.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
11

Detectron2

Next-generation platform for object detection and segmentation

Detectron2 is Facebook AI Research's next generation software system that implements state-of-the-art object detection algorithms. It is a ground-up rewrite of the previous version, Detectron, and it originates from maskrcnn-benchmark. It is powered by the PyTorch deep learning framework. Includes more features such as panoptic segmentation, Densepose, Cascade R-CNN, rotated bounding boxes, PointRend, DeepLab, etc. Can be used as a library to support different projects on top of it. We'll open source more research projects in this way. It trains much faster. ...

Downloads: 0 This Week

Last Update: 2021-10-26
See Project
12

CleverHans

An adversarial example library for constructing attacks

This repository contains the source code for CleverHans, a Python library to benchmark machine learning systems' vulnerability to adversarial examples. You can learn more about such vulnerabilities on the accompanying blog. The CleverHans library is under continual development, always welcoming contributions of the latest attacks and defenses. In particular, we always welcome help with resolving the issues currently open.

Downloads: 0 This Week

Last Update: 2024-08-02
See Project
13

CRSLab

CRSLab is an open-source toolkit

CRSLab is an open-source toolkit for building Conversational Recommender System (CRS). It is developed based on Python and PyTorch. CRSLab has the following highlights. Comprehensive benchmark models and datasets: We have integrated commonly-used 6 datasets and 18 models, including graph neural network and pre-training models such as R-GCN, BERT and GPT-2. We have preprocessed these datasets to support these models, and release for downloading. Extensive and standard evaluation protocols: We support a series of widely-adopted evaluation protocols for testing and comparing different CRS. ...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
14

XLM (Cross-lingual Language Model)

PyTorch original implementation of Cross-lingual Language Model

...Using a shared subword vocabulary, XLM learns language-agnostic features that work well for classification and sequence labeling tasks such as XNLI, NER, and POS without target-language supervision. The repository provides preprocessing pipelines, training code, and fine-tuning scripts so you can reproduce benchmark results or adapt models to your own multilingual corpora. Pretrained checkpoints cover dozens of languages and multiple model sizes, balancing quality and compute needs.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
15

Awesome Graph Classification

Graph embedding, classification and representation learning papers

A collection of graph classification methods, covering embedding, deep learning, graph kernel and factorization papers with reference implementations. Relevant graph classification benchmark datasets are available. Similar collections about community detection, classification/regression tree, fraud detection, Monte Carlo tree search, and gradient boosting papers with implementations.

Downloads: 0 This Week

Last Update: 2021-12-16
See Project
16

MachineLearningStocks

Using python and scikit-learn to make stock predictions

...Using libraries such as pandas and scikit-learn, the repository shows how historical financial indicators can be transformed into machine learning features. The model attempts to predict whether specific stocks will outperform a benchmark index such as the S&P 500. The repository includes scripts for parsing financial statistics, building training datasets, and performing backtesting to evaluate model performance over historical periods. Because it is structured as a template project, developers are encouraged to extend or modify the pipeline to test different algorithms, features, or investment strategies.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
17

gradslam

gradslam is an open source differentiable dense SLAM library

gradslam is an open-source framework providing differentiable building blocks for simultaneous localization and mapping (SLAM) systems. We enable the usage of dense SLAM subsystems from the comfort of PyTorch. The question of “representation” is central in the context of dense simultaneous localization and mapping (SLAM). Newer learning-based approaches have the potential to leverage data or task performance to directly inform the choice of representation. However, learning representations...

Downloads: 0 This Week

Last Update: 2022-08-22
See Project
18

TensorFlow 2.0 Tutorials

TensorFlow 2.x version's Tutorials and Examples

...Each section of the repository includes runnable code and structured experiments designed to illustrate how different architectures and algorithms function in real applications. The tutorials use well-known benchmark datasets such as MNIST, CIFAR, and Fashion-MNIST to demonstrate practical model training and evaluation workflows.

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
19

NLP-progress

Repository to track the progress in Natural Language Processing (NLP)

...It aims to cover both traditional and core NLP tasks such as dependency parsing and part-of-speech tagging as well as more recent ones such as reading comprehension and natural language inference. The main objective is to provide the reader with a quick overview of benchmark datasets and the state-of-the-art for their task of interest, which serves as a stepping stone for further research. To this end, if there is a place where results for a task are already published and regularly maintained, such as a public leaderboard, the reader will be pointed there.

Downloads: 0 This Week

Last Update: 2024-07-31
See Project
20

SMAC

SMAC: The StarCraft Multi-Agent Challenge

SMAC (StarCraft II Multi-Agent Challenge) is a benchmark environment for cooperative multi-agent reinforcement learning (MARL), based on real-time strategy (RTS) game scenarios in StarCraft II. It allows researchers to test algorithms where multiple units (agents) must collaborate to win battles against built-in game AI opponents. SMAC provides a controlled testbed for studying decentralized execution and centralized training paradigms in MARL.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
21

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation

Mask R-CNN Benchmark is a PyTorch-based framework that provides high-performance implementations of object detection, instance segmentation, and keypoint detection models. Originally built to benchmark Mask R-CNN and related models, it offers a clean, modular design to train and evaluate detection systems efficiently on standard datasets like COCO.

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
22

SSD

A PyTorch Implementation of Single Shot MultiBox Detector

...The repository includes the major components needed for an object detection workflow, including training scripts, evaluation scripts, demos, and utility modules. It supports commonly used benchmark datasets such as PASCAL VOC and MS COCO, and it also provides scripts to simplify downloading and setting up those datasets. For training visibility, the project includes support for Visdom so users can monitor loss in real time through a browser-based interface. Its structure makes it useful both as a reference implementation for learning SSD and as a base for custom experimentation in detection research or practical computer vision projects.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
23

HypSurGent

This program generates customizable hyper-surfaces (multi-dimensional input and output) and samples data from them to be used further as benchmark for response surface modeling tasks or optimization algorithms.

Downloads: 0 This Week

Last Update: 2016-09-05
See Project
24

DeepSeek-V3.2-Speciale

High-compute ultra-reasoning model surpassing model surpassing GPT-5

...DeepSeek-V3.2-Speciale contributed to gold-medal solutions in the 2025 IMO, IOI, ICPC World Finals, and CMO, demonstrating its ability to handle elite-level problem solving. It is released under the MIT license and includes curated benchmark solutions for community verification and analysis.

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
25

Mellum-4b-base

JetBrains’ 4B parameter code model for completions

Mellum-4b-base is JetBrains’ first open-source large language model designed and optimized for code-related tasks. Built with 4 billion parameters and a LLaMA-style architecture, it was trained on over 4.2 trillion tokens across multiple programming languages, including datasets such as The Stack, StarCoder, and CommitPack. With a context window of 8,192 tokens, it excels at code completion, fill-in-the-middle tasks, and intelligent code suggestions for professional developer tools and IDEs....

Downloads: 0 This Week

Last Update: 2025-09-11
See Project