GPU environment management and cluster orchestration
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Open-source tool designed to enhance the efficiency of workloads
PyTorch extensions for fast R&D prototyping and Kaggle farming
Probabilistic reasoning and statistical analysis in TensorFlow
Low-latency REST API for serving text-embeddings
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A toolkit to optimize ML models for deployment for Keras & TensorFlow
High quality, fast, modular reference implementation of SSD in PyTorch
Library for serving Transformers models on Amazon SageMaker
Serve machine learning models within a Docker container
Implementation of "Tree of Thoughts
Toolbox of models, callbacks, and datasets for AI/ML researchers
Lightweight anchor-free object detection model
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation
OpenFieldAI is an AI based Open Field Test Rodent Tracker
A computer vision framework to create and deploy apps in minutes
OpenMMLab Video Perception Toolbox
Training & Implementation of chatbots leveraging GPT-like architecture
Toolkit for allowing inference and serving with MXNet in SageMaker
CPU/GPU inference server for Hugging Face transformer models
Deploy a ML inference service on a budget in 10 lines of code