Scalable data pre processing and curation toolkit for LLMs
Operating LLMs in production
Access large language models from the command-line
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
ChatGLM-6B: An Open Bilingual Dialogue Language Model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Technical principles related to large models
lightweight package to simplify LLM API calls
Inference code for CodeLlama models
The official repo of Qwen chat & pretrained large language model
Simple, Pythonic building blocks to evaluate LLM applications
A high-performance ML model serving framework, offers dynamic batching
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Toolkit for conversational AI
Ongoing research training transformer models at scale
Adding guardrails to large language models
Visual Instruction Tuning: Large Language-and-Vision Assistant
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
LLM
Concatenate a directory full of files into a single prompt
File Parser optimised for LLM Ingestion with no loss
Open source libraries and APIs to build custom preprocessing pipelines
Framework to easily create LLM powered bots over any dataset