Audience

Companies searching for an advanced Deep Learning solution

About AWS Inferentia

AWS Inferentia accelerators are designed by AWS to deliver high performance at the lowest cost for your deep learning (DL) inference applications. The first-generation AWS Inferentia accelerator powers Amazon Elastic Compute Cloud (Amazon EC2) Inf1 instances, which deliver up to 2.3x higher throughput and up to 70% lower cost per inference than comparable GPU-based Amazon EC2 instances. Many customers, including Airbnb, Snap, Sprinklr, Money Forward, and Amazon Alexa, have adopted Inf1 instances and realized its performance and cost benefits. The first-generation Inferentia has 8 GB of DDR4 memory per accelerator and also features a large amount of on-chip memory. Inferentia2 offers 32 GB of HBM2e per accelerator, increasing the total memory by 4x and memory bandwidth by 10x over Inferentia.

Integrations

No integrations listed.

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

Amazon
Founded: 2006
United States
aws.amazon.com/machine-learning/inferentia/

Videos and Screen Captures

AWS Inferentia Screenshot 1
You Might Also Like
Achieve perfect load balancing with a flexible Open Source Load Balancer Icon
Achieve perfect load balancing with a flexible Open Source Load Balancer

Take advantage of Open Source Load Balancer to elevate your business security and IT infrastructure with a custom ADC Solution.

Boost application security and continuity with SKUDONET ADC, our Open Source Load Balancer, that maximizes IT infrastructure flexibility. Additionally, save up to $470 K per incident with AI and SKUDONET solutions, further enhancing your organization’s risk management and cost-efficiency strategies.

Product Details

Platforms Supported
SaaS
Training
Documentation
Support
Online

AWS Inferentia Frequently Asked Questions

Q: What kinds of users and organization types does AWS Inferentia work with?
Q: What languages does AWS Inferentia support in their product?
Q: What kind of support options does AWS Inferentia offer?
Q: What type of training does AWS Inferentia provide?

AWS Inferentia Product Features

Deep Learning

Visualization
Image Segmentation
Document Classification
ML Algorithm Library
Self-Learning
Neural Network Modeling
Model Training
Convolutional Neural Networks

Infrastructure-as-a-Service (IaaS)

Data Migration
Load Balancing
Network Monitoring
Analytics / Reporting
Configuration Management
Data Security
Performance Monitoring
SLA Monitoring
Log Access