EasyR1
An Efficient, Scalable, Multi-Modality RL Training Framework
...It emphasizes memory-efficient training strategies so you can train long-context or reasoning-dense models on commodity GPUs. The framework is also organized to help you compare training strategies (e.g., pure SFT vs. preference optimization) so you can see what actually moves metrics in math, code, and multi-step reasoning. For teams exploring open reasoning models, EasyR1 provides an opinionated yet flexible path from dataset to deployable checkpoints.