Beyond the Imitation Game collaborative benchmark for measuring
Code for Language models can explain neurons in language models paper
Evals is a framework for evaluating LLMs and LLM systems
CSGHub is a brand-new open-source platform for managing LLMs
Dramatron uses large language models to generate coherent scripts
New set of lightweight state-of-the-art, open foundation models
Collection of tutorials for Prompt Engineering techniques
Curated list of datasets and tools for post-training
DeepSeek LLM: Let there be answers
Implementations for various Generative AI Agent techniques
Fully private LLM chatbot that runs entirely with a browser
Course to get into Large Language Models (LLMs)
Open-source, high-performance Mixture-of-Experts large language model
Open-Source Financial Large Language Models!
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Open source large language model by Alibaba
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Editing large language models within 10 seconds
Implementation of model parallel autoregressive transformers on GPUs
Code for the paper Fine-Tuning Language Models from Human Preferences
Training and serving large-scale neural networks
Training Language Models to Follow Instructions with Human Feedback
8.5K high quality grade school math problems