Colossal-AI

The Transformer architecture has improved the performance of deep learning models in domains such as Computer Vision and Natural Language Processing. Together with better performance come larger model sizes. This imposes challenges to the memory wall of the current accelerator hardware such as GPU. It is never ideal to train large models such as Vision Transformer, BERT, and GPT on a single GPU or a single machine. There is an urgent demand to train models in a distributed environment. However, distributed training, especially model parallelism, often requires domain expertise in computer systems and architecture. It remains a challenge for AI researchers to implement complex distributed training solutions for their models. Colossal-AI provides a collection of parallel components for you. We aim to support you to write your distributed deep learning models just like how you write your model on your laptop.

Features

Heterogeneous Memory Management
24x larger model size on the same hardware
Pull from DockerHub
Build On Your Own
Parallelism strategies
Parallelism based on configuration file

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Colossal-AI

Colossal-AI Web Site

User Reviews

Be the first to post a review of Colossal-AI!

Additional Project Details

Programming Language

Python

Related Categories

Python Machine Learning Software, Python Computer Vision Libraries, Python Deep Learning Frameworks, Python Natural Language Processing (NLP) Tool

Registered

2022-08-02

Similar Business Software

DeepSpeed

DeepSpeed is an open source deep learning optimization library for PyTorch. It's designed to reduce computing power and memory use, and to train large distributed models with better parallelism on existing computer hardware. DeepSpeed is optimized for low latency, high throughput...

See Software
Clarifai

Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for developing better, faster and stronger AI. We help our customers create innovative solutions...

See Software
ChatGPT

ChatGPT is a language model developed by OpenAI. It has been trained on a diverse range of internet text, allowing it to generate human-like responses to a variety of prompts. ChatGPT can be used for various natural language processing tasks, such as question answering, conversation, and text...

See Software

Report inappropriate content

Colossal-AI

Making large AI models cheaper, faster and more accessible

Features

Project Samples

Project Activity

Categories

License

Follow Colossal-AI

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered