Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
Implementation of RLHF (Reinforcement Learning with Human Feedback)
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Language Model Reinforcement Learning Environments frameworks
Jupyter Notebook tutorials for REINVENT 3.2
Reinforced Recommendation toolkit built around pytorch 1.7