Images to inference with no labeling
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
GPU environment management and cluster orchestration
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Framework that is dedicated to making neural data processing
PyTorch extensions for fast R&D prototyping and Kaggle farming
Probabilistic reasoning and statistical analysis in TensorFlow
Low-latency REST API for serving text-embeddings
Tensor search for humans
A toolkit to optimize ML models for deployment for Keras & TensorFlow
High quality, fast, modular reference implementation of SSD in PyTorch
Create HTML profiling reports from pandas DataFrame objects
Library for serving Transformers models on Amazon SageMaker
Serve machine learning models within a Docker container
Implementation of "Tree of Thoughts
Toolbox of models, callbacks, and datasets for AI/ML researchers
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation
A computer vision framework to create and deploy apps in minutes
OpenMMLab Video Perception Toolbox
Training & Implementation of chatbots leveraging GPT-like architecture
Guide to deploying deep-learning inference networks
Toolkit for allowing inference and serving with MXNet in SageMaker
CPU/GPU inference server for Hugging Face transformer models
Deploy a ML inference service on a budget in 10 lines of code