Simple, Pythonic building blocks to evaluate LLM applications
PyTorch library of curated Transformer models and their components
Bringing BERT into modernity via both architecture changes and scaling
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Concatenate a directory full of files into a single prompt
MoBA: Mixture of Block Attention for Long-Context LLMs
Low-code framework for building custom LLMs, neural networks
Neural Network architecture based on ideas of the original LSTM