Trainable models and NN optimization tools
Train machine learning models within Docker containers
Toolkit for running TensorFlow training scripts on SageMaker
High-level training, data augmentation, and utilities for Pytorch
A lightweight library for PyTorch training tools and utilities
Supercharge Your Model Training
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Train a 26M-parameter GPT from scratch in just 2h
Faster and easier training and deployments
State-of-the-art 2D and 3D Face Analysis Project
slime is an LLM post-training framework for RL Scaling
Training Large Language Model to Reason in a Continuous Latent Space
The simplest, fastest repository for training/finetuning models
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
An open-source, modern-design AI training tracking and visualization
Training data (data labeling, annotation, workflow) for all data types
Ongoing research training transformer models at scale
Reference PyTorch implementation and models for DINOv3
Feature Store for Machine Learning
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Reference implementations of MLPerf™ training benchmarks
AI agents running research on single-GPU nanochat training
Unified web UI for training and running open models locally
Train multi-step agents for real-world tasks using GRPO