Megatron-LM

Megatron-LM is a GPU-optimized deep learning framework from NVIDIA designed to train extremely large transformer-based language models efficiently at scale. The repository provides both a reference training implementation and Megatron Core, a composable library of high-performance building blocks for custom large-model pipelines. It supports advanced parallelism strategies including tensor, pipeline, data, expert, and context parallelism, enabling training across massive multi-GPU and multi-node clusters. The framework includes mixed-precision training options such as FP16, BF16, FP8, and FP4 to maximize performance and memory efficiency on modern hardware. Megatron-LM is widely used in research and industry for pretraining GPT-, BERT-, T5-, and multimodal-style models, with tooling for checkpoint conversion and interoperability with Hugging Face. Overall, it is a production-grade system for organizations pushing the limits of large-scale language model training.

Features

GPU-optimized transformer training
Advanced parallelism strategies
Mixed precision training support
Composable Megatron Core library
Hugging Face checkpoint conversion
Multi-node scalable training pipelines

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Megatron-LM

Megatron-LM Web Site

Other Useful Business Software

Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free

Rate This Project

User Reviews

Be the first to post a review of Megatron-LM!

Additional Project Details

Programming Language

Python

Related Categories

Python Research Software

Registered

2026-02-25

Similar Business Software

KnowAll Matrix

Bailey Solutions offers cost-effective Library Management Systems (LMS) that can be hosted in the cloud or on your own servers. Our KnowAll Matrix Library System is designed by a library consultant in consultation with clients. 99% customer retention. Our core system includes: Catalogue:...

See Software
Bibliosoft

Make your library’s major tasks hassle-free. Now keeping records of books, and issues, and receiving books is fun. The barcode facility makes issues and receiving tasks very easy. Bibliosoft (Electronic Library System) is a powerful, flexible, and easy to use Library Management Software in...

See Software
CodeAchi Library Management System

CodeAchi Library Management System is globally trusted software for Library Automation. This software is very easy to install to your windows computer and access it to multiple computer using LAN. CodeAchi LMS features training via documentation, live online, and webinars and YouTube videos and...

See Software
Colibris

Colibris is an integrated library management system in the cloud. It allows you to search through a collection of books, magazines, cd's, documents and other materials. Simply scan the EAN / ISBN barcode and Colibris will find the bibliographic details of over 25 million objects. Registering...

See Software
ResourceMate

ResourceMate is a library automation solution designed to provide comprehensive cataloguing, searching, and circulating of any type of resource. ResourceMate is suitable for different types of organizations, including libraries, schools, places of worship, retirement communities, correctional...

See Software
Handy Library Manager

Run your library in seconds. Easy-to-use and straightforward for everyone. Download and try it now! Manage your library items, borrower information, and circulation transactions. You will find all the features necessary to perform all your library management tasks. Handy Library Manager is fully...

See Software

Report inappropriate content

Megatron-LM

Ongoing research training transformer models at scale

Get an email when there's a new version of Megatron-LM

Features

Project Samples

Project Activity

Categories

License

Follow Megatron-LM

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered