AWS NeuronAmazon Web Services
|
Amazon EC2 P4 InstancesAmazon
|
|||||
Related Products
|
||||||
About
It supports high-performance training on AWS Trainium-based Amazon Elastic Compute Cloud (Amazon EC2) Trn1 instances. For model deployment, it supports high-performance and low-latency inference on AWS Inferentia-based Amazon EC2 Inf1 instances and AWS Inferentia2-based Amazon EC2 Inf2 instances. With Neuron, you can use popular frameworks, such as TensorFlow and PyTorch, and optimally train and deploy machine learning (ML) models on Amazon EC2 Trn1, Inf1, and Inf2 instances with minimal code changes and without tie-in to vendor-specific solutions. AWS Neuron SDK, which supports Inferentia and Trainium accelerators, is natively integrated with PyTorch and TensorFlow. This integration ensures that you can continue using your existing workflows in these popular frameworks and get started with only a few lines of code changes. For distributed model training, the Neuron SDK supports libraries, such as Megatron-LM and PyTorch Fully Sharded Data Parallel (FSDP).
|
About
Amazon EC2 P4d instances deliver high performance for machine learning training and high-performance computing applications in the cloud. Powered by NVIDIA A100 Tensor Core GPUs, they offer industry-leading throughput and low-latency networking, supporting 400 Gbps instance networking. P4d instances provide up to 60% lower cost to train ML models, with an average of 2.5x better performance for deep learning models compared to previous-generation P3 and P3dn instances. Deployed in hyperscale clusters called Amazon EC2 UltraClusters, P4d instances combine high-performance computing, networking, and storage, enabling users to scale from a few to thousands of NVIDIA A100 GPUs based on project needs. Researchers, data scientists, and developers can utilize P4d instances to train ML models for use cases such as natural language processing, object detection and classification, and recommendation engines, as well as to run HPC applications like pharmaceutical discovery and more.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Organizations in need of an SDK solution with a compiler, runtime, and profiling tools that unlocks high-performance and cost-effective deep learning acceleration
|
Audience
Organizations wanting a cloud-based infrastructure to optimize their machine learning training processes
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
$11.57 per hour
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationAmazon Web Services
Founded: 2006
United States
aws.amazon.com/machine-learning/neuron/
|
Company InformationAmazon
Founded: 1994
United States
aws.amazon.com/ec2/instance-types/p4/
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
AWS Deep Learning AMIs
AWS Deep Learning Containers
AWS Trainium
Amazon EC2 Capacity Blocks for ML
Amazon EC2 G5 Instances
Amazon EC2 Inf1 Instances
Amazon EC2 P5 Instances
Amazon EC2 Trn1 Instances
Amazon EC2 Trn2 Instances
Amazon EC2 UltraClusters
|
Integrations
AWS Deep Learning AMIs
AWS Deep Learning Containers
AWS Trainium
Amazon EC2 Capacity Blocks for ML
Amazon EC2 G5 Instances
Amazon EC2 Inf1 Instances
Amazon EC2 P5 Instances
Amazon EC2 Trn1 Instances
Amazon EC2 Trn2 Instances
Amazon EC2 UltraClusters
|
|||||
|
|
|