NVIDIA TensorRTNVIDIA
|
OpenCLThe Khronos Group
|
|||||
Related Products
|
||||||
About
NVIDIA TensorRT is an ecosystem of APIs for high-performance deep learning inference, encompassing an inference runtime and model optimizations that deliver low latency and high throughput for production applications. Built on the CUDA parallel programming model, TensorRT optimizes neural network models trained on all major frameworks, calibrating them for lower precision with high accuracy, and deploying them across hyperscale data centers, workstations, laptops, and edge devices. It employs techniques such as quantization, layer and tensor fusion, and kernel tuning on all types of NVIDIA GPUs, from edge devices to PCs to data centers. The ecosystem includes TensorRT-LLM, an open source library that accelerates and optimizes inference performance of recent large language models on the NVIDIA AI platform, enabling developers to experiment with new LLMs for high performance and quick customization through a simplified Python API.
|
About
OpenCL (Open Computing Language) is an open, royalty-free standard for cross-platform parallel programming of heterogeneous computing systems that lets developers accelerate computing tasks by leveraging diverse processors such as CPUs, GPUs, DSPs, and FPGAs across supercomputers, cloud servers, personal computers, mobile devices, and embedded platforms. It defines a programming framework including a C-based language for writing compute kernels and a runtime API to control devices, manage memory, and execute parallel code, giving portable and efficient access to heterogeneous hardware. OpenCL improves speed and responsiveness for a wide range of applications including creative tools, scientific and medical software, vision processing, and neural network training and inferencing by offloading compute-intensive work to accelerator processors.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Machine learning engineers and data scientists seeking a tool to optimize their deep learning operations
|
Audience
Software developers and engineers building high-performance parallel applications that need portable, cross-platform computing across diverse processors
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
No information available.
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationNVIDIA
Founded: 1993
United States
developer.nvidia.com/tensorrt
|
Company InformationThe Khronos Group
Founded: 2000
United States
www.khronos.org
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
||||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
C
CUDA
Dataoorts GPU Cloud
GitHub
Google
Hugging Face
Kimi K2
LaunchX
MATLAB
NVIDIA Broadcast
|
Integrations
C
CUDA
Dataoorts GPU Cloud
GitHub
Google
Hugging Face
Kimi K2
LaunchX
MATLAB
NVIDIA Broadcast
|
|||||
|
|
|