DeepSeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective. DeepSpeed delivers extreme-scale model training for everyone, from data scientists training on massive supercomputers to those training on low-end clusters or even on a single GPU. Using current generation of GPU clusters with hundreds of devices, 3D parallelism of DeepSpeed can efficiently train deep learning models with trillions of parameters. With just a single GPU, ZeRO-Offload of DeepSpeed can train models with over 10B parameters, 10x bigger than the state of arts, democratizing multi-billion-parameter model training such that many deep learning scientists can explore bigger and better models. Sparse attention of DeepSpeed powers an order-of-magnitude longer input sequence and obtains up to 6x faster execution comparing with dense transformers.

Features

10x larger models and 10x faster training
Minimal code change
Extremely memory efficient
Extremely long sequence length
Extremely communication efficient
An initiative to enable next-generation AI capabilities at scale

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow DeepSeed

DeepSeed Web Site

User Reviews

Be the first to post a review of DeepSeed!

Additional Project Details

Registered

2021-09-23

Similar Business Software

DeepSpeed

DeepSpeed is an open source deep learning optimization library for PyTorch. It's designed to reduce computing power and memory use, and to train large distributed models with better parallelism on existing computer hardware. DeepSpeed is optimized for low latency, high throughput...

See Software
AlxBlock

AIxBlock is a blockchain-based end-to-end platform for AI, harnessing unused computing resources from BTC miners and all idle global consumer GPUs. Our platform's core training method is a hybrid distributed machine learning approach, enabling simultaneous training across multiple nodes. We...

See Software
Amazon SageMaker Model Training

Amazon SageMaker Model Training reduces the time and cost to train and tune machine learning (ML) models at scale without the need to manage infrastructure. You can take advantage of the highest-performing ML compute infrastructure currently available, and SageMaker can automatically scale...

See Software

Report inappropriate content

DeepSeed

Deep learning optimization library making distributed training easy

Features

Project Samples

Project Activity

Categories

License

Follow DeepSeed

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered