Training neural networks on Apple Neural Engine via APIs
A Next-Generation Training Engine Built for Ultra-Large MoE Models
A lightweight library for PyTorch training tools and utilities
Powerful AI language model (MoE) optimized for efficiency/performance
slime is an LLM post-training framework for RL Scaling
The official repository for ERNIE 4.5 and ERNIEKit
Supercharge Your Model Training
Reference PyTorch implementation and models for DINOv3
Train any agents simply by 'talking'
Democratizing Reinforcement Learning for LLMs
Large Language Model Principles and Practice Tutorial from Scratch
Faster and easier training and deployments
Curated list of datasets and tools for post-training
Ongoing research training transformer models at scale
A general fine-tuning kit geared toward image/video/audio diffusion
PyTorch implementation of JiT
A simple, performant and scalable Jax LLM
An open-source, modern-design AI training tracking and visualization
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Distributed AI Model Training and LLM Fine-Tuning on Kubernetes
dLLM: Simple Diffusion Language Modeling
Empowering Code Generation with OSS-Instruct
Robust recipes to align language models with human and AI preferences
Recipes to train reward model for RLHF
Deep learning optimization library: makes distributed training easy