nanoGPT

NanoGPT is a minimalistic yet powerful reimplementation of GPT-style transformers created by Andrej Karpathy for educational and research use. It distills the GPT architecture into a few hundred lines of Python code, making it far easier to understand than large, production-scale implementations. The repo is organized with a training pipeline (dataset preprocessing, model definition, optimizer, training loop) and inference script so you can train a small GPT on text datasets like Shakespeare or custom corpora. It emphasizes readability and clarity: the training loop is cleanly written, and the code avoids heavy abstractions, letting students follow the architecture step by step. While simple, it can still train non-trivial models on modern GPUs and generate coherent text. The project has become widely used in tutorials, courses, and experiments for people learning how transformers work under the hood.

Features

Compact GPT transformer implementation in plain Python/PyTorch
Data preprocessing pipeline for text datasets (e.g. Shakespeare)
Training loop with clear optimizer and scheduler setup
Inference script for text generation after training
Readable, educational codebase (few hundred lines)
Supports running on modern GPUs for small to mid-sized models

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow nanoGPT

nanoGPT Web Site

Other Useful Business Software

Keep company data safe with Chrome Enterprise

Protect your business with AI policies and data loss prevention in the browser

Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.

Download Chrome

Rate This Project

User Reviews

Be the first to post a review of nanoGPT!

Additional Project Details

Operating Systems

Windows

Programming Language

Python

Related Categories

Python Research Software

Registered

3 days ago

Similar Business Software

KnowAll Matrix

Bailey Solutions offers cost-effective Library Management Systems (LMS) that can be hosted in the cloud or on your own servers. Our KnowAll Matrix Library System is designed by a library consultant in consultation with clients. 99% customer retention. Our core system includes: Catalogue:...

See Software
Bibliosoft

Make your library’s major tasks hassle-free. Now keeping records of books, and issues, and receiving books is fun. The barcode facility makes issues and receiving tasks very easy. Bibliosoft (Electronic Library System) is a powerful, flexible, and easy to use Library Management Software in...

See Software
CodeAchi Library Management System

CodeAchi Library Management System is globally trusted software for Library Automation. This software is very easy to install to your windows computer and access it to multiple computer using LAN. CodeAchi LMS features training via documentation, live online, and webinars and YouTube videos and...

See Software
All My Books

The only cataloging software you'll ever need. All My Books helps you archive, organize and track your book collection through an easy-to-use, flexible interface. Whether you're working with printed, audio, e-books — or a combination of all three; All My Books has exactly what you need to...

See Software
Tanaza

Tanaza is an intuitive and responsive cloud-based management software for IT professionals to manage WiFi networks remotely. At the core of Tanaza's technology, there's TanazaOS, a powerful Linux-based Operating System compatible with multiple brands of access points. Tanaza makes the...

See Software
Colibris

Colibris is an integrated library management system in the cloud. It allows you to search through a collection of books, magazines, cd's, documents and other materials. Simply scan the EAN / ISBN barcode and Colibris will find the bibliographic details of over 25 million objects. Registering...

See Software

Report inappropriate content

nanoGPT

The simplest, fastest repository for training/finetuning models

Get an email when there's a new version of nanoGPT

Features

Project Samples

Project Activity

Categories

License

Follow nanoGPT

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered