Implementation of RLHF (Reinforcement Learning with Human Feedback)
Volcano Engine Reinforcement Learning for LLMs
An Easy-to-use, Scalable and High-performance RLHF Framework
Language Model Reinforcement Learning Environments frameworks
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Optical-packet node transceiver frequency allocation