AWS EC2 Trn3 InstancesAmazon
|
||||||
Related Products
|
||||||
About
Amazon EC2 Trn3 UltraServers are AWS’s newest accelerated computing instances, powered by the in-house Trainium3 AI chips and engineered specifically for high-performance deep-learning training and inference workloads. These UltraServers are offered in two configurations, a “Gen1” with 64 Trainium3 chips and a “Gen2” with up to 144 Trainium3 chips per UltraServer. The Gen2 configuration delivers up to 362 petaFLOPS of dense MXFP8 compute, 20 TB of HBM memory, and a staggering 706 TB/s of aggregate memory bandwidth, making it one of the highest-throughput AI compute platforms available. Interconnects between chips are handled by a new “NeuronSwitch-v1” fabric to support all-to-all communication patterns, which are especially important for large models, mixture-of-experts architectures, or large-scale distributed training.
|
About
Build, train, and deploy models faster at scale with fully managed infrastructure, tools, and workflows.
Deploy custom AI & LLMs on any infrastructure in seconds and scale inference with ease. Handle your most demanding tasks with batch job scheduling, only paying with per-second billing. Optimize costs with GPU usage, spot instances, and built-in automatic failover. Train with a single command with YAML, simplifying complex infrastructure setups. Automatically scale up workers during high traffic and scale down to zero during inactivity. Deploy cutting-edge models with persistent endpoints in a serverless environment, optimizing resource usage. Monitor system and inference metrics in real-time, including worker count, GPU utilization, latency, and throughput. Efficiently conduct A/B testing by splitting traffic among multiple models for evaluation.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
AI researchers, data scientists, and enterprises looking for a solution for training and deploying large language models, generative-AI systems, and other deep-learning workloads
|
Audience
High-performance ML teams
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
$100 + compute/month
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationAmazon
Founded: 1994
United States
aws.amazon.com/ec2/instance-types/trn3/
|
Company InformationVESSL AI
Founded: 2020
United States
vessl.ai/
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
||||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
Amazon Web Services (AWS)
AWS ParallelCluster
Amazon EKS
Amazon Elastic Container Service (Amazon ECS)
Amazon SageMaker
FLUX.1
FLUX.2
Gemma
Gemma 2
Google Cloud Platform
|
Integrations
Amazon Web Services (AWS)
AWS ParallelCluster
Amazon EKS
Amazon Elastic Container Service (Amazon ECS)
Amazon SageMaker
FLUX.1
FLUX.2
Gemma
Gemma 2
Google Cloud Platform
|
|||||
|
|
|