tiktoken is a fast BPE tokeniser for use with OpenAI's models
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Implementation of model parallel autoregressive transformers on GPUs
An implementation of model parallel GPT-2 and GPT-3-style models
CTC-based forced aligner for audio-text in 158 languages