Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Deep learning optimization library: makes distributed training easy
A graphical manager for ollama that can manage your LLMs
Run 100B+ language models at home, BitTorrent-style
Toolbox of models, callbacks, and datasets for AI/ML researchers
Implementation of model parallel autoregressive transformers on GPUs
Guide to deploying deep-learning inference networks
Training & Implementation of chatbots leveraging GPT-like architecture