Amazon Elastic InferenceAmazon
|
||||||
Related Products
|
||||||
About
Amazon Elastic Inference allows you to attach low-cost GPU-powered acceleration to Amazon EC2 and Sagemaker instances or Amazon ECS tasks, to reduce the cost of running deep learning inference by up to 75%. Amazon Elastic Inference supports TensorFlow, Apache MXNet, PyTorch and ONNX models. Inference is the process of making predictions using a trained model. In deep learning applications, inference accounts for up to 90% of total operational costs for two reasons. Firstly, standalone GPU instances are typically designed for model training - not for inference. While training jobs batch process hundreds of data samples in parallel, inference jobs usually process a single input in real time, and thus consume a small amount of GPU compute. This makes standalone GPU inference cost-inefficient. On the other hand, standalone CPU instances are not specialized for matrix operations, and thus are often too slow for deep learning inference.
|
About
Exafunction optimizes your deep learning inference workload, delivering up to a 10x improvement in resource utilization and cost. Focus on building your deep learning application, not on managing clusters and fine-tuning performance. In most deep learning applications, CPU, I/O, and network bottlenecks lead to poor utilization of GPU hardware. Exafunction moves any GPU code to highly utilized remote resources, even spot instances. Your core logic remains an inexpensive CPU instance. Exafunction is battle-tested on applications like large-scale autonomous vehicle simulation. These workloads have complex custom models, require numerical reproducibility, and use thousands of GPUs concurrently. Exafunction supports models from major deep learning frameworks and inference runtimes. Models and dependencies like custom operators are versioned so you can always be confident you’re getting the right results.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
IT teams that need an advanced Infrastructure as a Service solution
|
Audience
Enterprises searching for a solution to optimize their deep learning inference workload
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
No information available.
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationAmazon
Founded: 2006
United States
aws.amazon.com/machine-learning/elastic-inference/
|
Company InformationExafunction
exafunction.com
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
|
|||||
|
|
||||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
PyTorch
TensorFlow
Amazon EC2
Amazon EC2 G4 Instances
Amazon Web Services (AWS)
MXNet
|
Integrations
PyTorch
TensorFlow
Amazon EC2
Amazon EC2 G4 Instances
Amazon Web Services (AWS)
MXNet
|
|||||
|
|
|