CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
LLM training code for MosaicML foundation models
Curated list of datasets and tools for post-training
Code for Language models can explain neurons in language models paper
Training Language Models to Follow Instructions with Human Feedback