A library for accelerating Transformer models on NVIDIA GPUs
Library for OCR-related tasks powered by Deep Learning
Images to inference with no labeling
A set of Docker images for training and serving models in TensorFlow
Low-latency REST API for serving text-embeddings
Standardized Serverless ML Inference Platform on Kubernetes
High quality, fast, modular reference implementation of SSD in PyTorch
A graphical manager for ollama that can manage your LLMs
Database system for building simpler and faster AI-powered application
Serve machine learning models within a Docker container
Toolbox of models, callbacks, and datasets for AI/ML researchers
Lightweight anchor-free object detection model
Sequence-to-sequence framework, focused on Neural Machine Translation
Toolkit for allowing inference and serving with MXNet in SageMaker