Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
GPU environment management and cluster orchestration
Framework that is dedicated to making neural data processing
Open-source tool designed to enhance the efficiency of workloads
State-of-the-art Parameter-Efficient Fine-Tuning
PyTorch extensions for fast R&D prototyping and Kaggle farming
Probabilistic reasoning and statistical analysis in TensorFlow
The Triton Inference Server provides an optimized cloud
Low-latency REST API for serving text-embeddings
Replace OpenAI GPT with another LLM in your app
Open platform for training, serving, and evaluating language models
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
Run 100B+ language models at home, BitTorrent-style
OpenMMLab Model Deployment Framework
A toolkit to optimize ML models for deployment for Keras & TensorFlow
A lightweight vision library for performing large object detection
Create HTML profiling reports from pandas DataFrame objects
Serve machine learning models within a Docker container
Powering Amazon custom machine learning chips
Implementation of "Tree of Thoughts
Toolbox of models, callbacks, and datasets for AI/ML researchers
Lightweight anchor-free object detection model
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation