Library for serving Transformers models on Amazon SageMaker
A set of Docker images for training and serving models in TensorFlow
Standardized Serverless ML Inference Platform on Kubernetes
Optimizing inference proxy for LLMs
State-of-the-art Parameter-Efficient Fine-Tuning
Tensor search for humans
Probabilistic reasoning and statistical analysis in TensorFlow
Powering Amazon custom machine learning chips
Low-latency REST API for serving text-embeddings
Database system for building simpler and faster AI-powered application
Lightweight Python library for adding real-time multi-object tracking
Uncover insights, surface problems, monitor, and fine tune your LLM
High quality, fast, modular reference implementation of SSD in PyTorch
Open platform for training, serving, and evaluating language models
Libraries for applying sparsification recipes to neural networks
Gaussian processes in TensorFlow
Visual Instruction Tuning: Large Language-and-Vision Assistant
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Openai style api for open large language models
A Unified Library for Parameter-Efficient Learning
Images to inference with no labeling
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Open-source tool designed to enhance the efficiency of workloads
OpenMMLab Model Deployment Framework