throughput free download

TensorRT

C++ library for high performance inference on NVIDIA GPUs

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference. It includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for deep learning inference applications. TensorRT-based applications perform up to 40X faster than CPU-only platforms during inference. With TensorRT, you can optimize neural network models trained in all major frameworks, calibrate for lower precision with high accuracy, and deploy to hyperscale data centers, embedded, or automotive product platforms. ...

Downloads: 19 This Week

Last Update: 2026-03-25

See Project

OpenMLDB

OpenMLDB is an open-source machine learning database

...However, a feature engineering script developed by data scientists (Python scripts in most cases) cannot be directly deployed into production for online inference because it usually cannot meet the engineering requirements, such as low latency, high throughput and high availability.

Downloads: 0 This Week

Last Update: 2025-02-21

See Project

DALI

A GPU-accelerated library containing highly optimized building blocks

...DALI addresses the problem of the CPU bottleneck by offloading data preprocessing to the GPU. Additionally, DALI relies on its own execution engine, built to maximize the throughput of the input pipeline.

Downloads: 0 This Week

Last Update: 2026-04-16

See Project

OnnxStream

Lightweight inference library for ONNX files, written in C++

...The recommended minimum RAM/VRAM for Stable Diffusion 1.5 is typically 8GB. Generally, major machine learning frameworks and libraries are focused on minimizing inference latency and/or maximizing throughput, all of which at the cost of RAM usage. So I decided to write a super small and hackable inference library specifically focused on minimizing memory consumption: OnnxStream. OnnxStream is based on the idea of decoupling the inference engine from the component responsible for providing the model weights, which is a class derived from WeightsProvider. ...

Downloads: 24 This Week

Last Update: 2024-08-14

See Project

Search Results for "throughput"

Showing 4 open source projects for "throughput"

TensorRT

OpenMLDB

DALI

OnnxStream

Search Results for "throughput"

Showing 4 open source projects for "throughput"

TensorRT

OpenMLDB

DALI

OnnxStream

Related Searches

Related Categories