GitLab automatic code review tool based on large models
GLM-4 series: Open Multilingual Multimodal Chat LMs
Seamlessly integrate LLMs into scikit-learn
Constrained Value Alignment via Safe Reinforcement Learning
Recipes to train reward model for RLHF
Scalable RL solution for advanced reasoning of language models
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Robust recipes to align language models with human and AI preferences
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Official Repo for ICML 2024 paper
An implementation of model parallel GPT-2 and GPT-3-style models