Ongoing research training transformer models at scale
Large-language-model & vision-language-model based on Linear Attention
Language Model Reinforcement Learning Environments frameworks
Implementation of model parallel autoregressive transformers on GPUs
PyTorch original implementation of Cross-lingual Language Model
Toolkit for efficient experimentation with Speech Recognition
Parallel Optimization Library for Java
CRFSharp is a .NET(C#) implementation of Conditional Random Field