DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective. DeepSpeed delivers extreme-scale model training for everyone, from data scientists training on massive supercomputers to those training on low-end clusters or even on a single GPU. Using current generation of GPU clusters with hundreds of devices, 3D parallelism of DeepSpeed can efficiently train deep learning models with trillions of parameters. With just a single GPU, ZeRO-Offload of DeepSpeed can train models with over 10B parameters, 10x bigger than the state of arts, democratizing multi-billion-parameter model training such that many deep learning scientists can explore bigger and better models. Sparse attention of DeepSpeed powers an order-of-magnitude longer input sequence and obtains up to 6x faster execution comparing with dense transformers.

Features

  • 10x larger models and 10x faster training
  • Minimal code change
  • Extremely memory efficient
  • Extremely long sequence length
  • Extremely communication efficient
  • An initiative to enable next-generation AI capabilities at scale

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow DeepSeed

DeepSeed Web Site

You Might Also Like
SKUDONET Open Source Load Balancer Icon
SKUDONET Open Source Load Balancer

Take advantage of Open Source Load Balancer to elevate your business security and IT infrastructure with a custom ADC Solution.

SKUDONET ADC, operates at the application layer, efficiently distributing network load and application load across multiple servers. This not only enhances the performance of your application but also ensures that your web servers can handle more traffic seamlessly.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DeepSeed!

Additional Project Details

Operating Systems

Windows

Programming Language

Python

Related Categories

Python Libraries, Python Machine Learning Software, Python Deep Learning Frameworks

Registered

2021-09-23