Audience

IT teams that need an advanced Infrastructure as a Service solution

About Amazon Elastic Inference

Amazon Elastic Inference allows you to attach low-cost GPU-powered acceleration to Amazon EC2 and Sagemaker instances or Amazon ECS tasks, to reduce the cost of running deep learning inference by up to 75%. Amazon Elastic Inference supports TensorFlow, Apache MXNet, PyTorch and ONNX models. Inference is the process of making predictions using a trained model. In deep learning applications, inference accounts for up to 90% of total operational costs for two reasons. Firstly, standalone GPU instances are typically designed for model training - not for inference. While training jobs batch process hundreds of data samples in parallel, inference jobs usually process a single input in real time, and thus consume a small amount of GPU compute. This makes standalone GPU inference cost-inefficient. On the other hand, standalone CPU instances are not specialized for matrix operations, and thus are often too slow for deep learning inference.

Integrations

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

Amazon
Founded: 2006
United States
aws.amazon.com/machine-learning/elastic-inference/

Videos and Screen Captures

Other Useful Business Software
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
Start Free

Product Details

Platforms Supported
Cloud
Training
Documentation
Support
Online

Amazon Elastic Inference Frequently Asked Questions

Q: What kinds of users and organization types does Amazon Elastic Inference work with?
Q: What languages does Amazon Elastic Inference support in their product?
Q: What kind of support options does Amazon Elastic Inference offer?
Q: What other applications or services does Amazon Elastic Inference integrate with?
Q: What type of training does Amazon Elastic Inference provide?

Amazon Elastic Inference Product Features

Infrastructure-as-a-Service (IaaS)

Data Migration
Load Balancing
Network Monitoring
Analytics / Reporting
Configuration Management
Data Security
Performance Monitoring
SLA Monitoring
Log Access