A high-throughput and memory-efficient inference and serving engine
Implementation of model parallel autoregressive transformers on GPUs
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Python Optimal Transport
Autonomous GPT-4 agent platform
Extensible, parallel implementations of t-SNE
Run 100B+ language models at home, BitTorrent-style
Differentiable SDE solvers with GPU support and efficient sensitivity
Seamlessly integrate LLMs as Python functions
Fault-tolerant, highly scalable GPU orchestration
Fast Python collaborative filtering for implicit feedback datasets
Ongoing research training transformer models at scale
A text generation library with pre-trained language models github.com
Django friendly finite state machine support
Pytorch domain library for recommendation systems
Making large AI models cheaper, faster and more accessible
A unified framework for scalable computing
Stanford NLP Python library for many human languages
Distributed Deep learning with Keras & Spark
Facebook AI Research Sequence-to-Sequence Toolkit written in Python
Machine learning tool that allows you to train and test models
An implementation of model parallel GPT-2 and GPT-3-style models
PAddle PARAllel text-to-speech toolKIT
Fast & easy transfer learning for NLP
End-to-end object detection with transformers