gpu max performance free download

DLRM

An implementation of a deep learning recommendation model (DLRM)

...The architecture combines dense (MLP) and sparse (embedding) branches, then interacts features via dot product or feature interactions before passing through further dense layers to predict click-through, ranking scores, or conversion probabilities. The implementation is optimized for performance at scale, supporting multi-GPU and multi-node execution, quantization, embedding partitioning, and pipelined I/O to feed huge embeddings efficiently. It includes data loaders for standard benchmarks (like Criteo), training scripts, evaluation tools, and capabilities like mixed precision, gradient compression, and memory fusion to maximize throughput.

Downloads: 0 This Week

Last Update: 2026-01-12

See Project

DeepSpeed

Deep learning optimization library: makes distributed training easy

DeepSpeed is an easy-to-use deep learning optimization software suite that enables unprecedented scale and speed for Deep Learning Training and Inference. With DeepSpeed you can: 1. Train/Inference dense or sparse models with billions or trillions of parameters 2. Achieve excellent system throughput and efficiently scale to thousands of GPUs 3. Train/Inference on resource constrained GPU systems 4. Achieve unprecedented low latency and high throughput for inference 5. Achieve extreme...

Downloads: 0 This Week

Last Update: 17 hours ago

See Project

pipeless

A computer vision framework to create and deploy apps in minutes

...You can easily use industry-standard models, such as YOLO, or load your custom model in one of the supported inference runtimes. Pipeless ships some of the most popular inference runtimes, such as the ONNX Runtime, allowing you to run inference with high performance on CPU or GPU out-of-the-box. You can deploy your Pipeless application with a single command to edge and IoT devices or the cloud.

Downloads: 0 This Week

Last Update: 2024-02-23

See Project

MLPACK C++ machine learning library

MLPACK is a C++ machine learning library with emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and flexibility for expert users. * More info + downloads: https://mlpack.org * Git repo: https://github.com/mlpack/mlpack

Downloads: 0 This Week

Last Update: 2023-06-28

See Project

Search Results for "gpu max performance"

4 projects for "gpu max performance" with 2 filters applied:

DLRM

DeepSpeed

pipeless

MLPACK C++ machine learning library

Search Results for "gpu max performance"

4 projects for "gpu max performance" with 2 filters applied:

DLRM

DeepSpeed

pipeless

MLPACK C++ machine learning library

Related Searches

Related Categories