A high-performance ML model serving framework, offers dynamic batching
Unified Model Serving Framework
Low-latency REST API for serving text-embeddings
Integrate, train and manage any AI models and APIs with your database
State-of-the-art diffusion models for image and audio generation
PyTorch extensions for fast R&D prototyping and Kaggle farming
A lightweight vision library for performing large object detection
Library for serving Transformers models on Amazon SageMaker
Lightweight Python library for adding real-time multi-object tracking
MII makes low-latency and high-throughput inference possible
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
A unified framework for scalable computing
AIMET is a library that provides advanced quantization and compression
PyTorch library of curated Transformer models and their components
A library to communicate with ChatGPT, Claude, Copilot, Gemini
A toolkit to optimize ML models for deployment for Keras & TensorFlow
An easy-to-use LLMs quantization package with user-friendly apis
The unofficial python package that returns response of Google Bard
A graphical manager for ollama that can manage your LLMs
Images to inference with no labeling
Open platform for training, serving, and evaluating language models
Visual Instruction Tuning: Large Language-and-Vision Assistant
High quality, fast, modular reference implementation of SSD in PyTorch
OpenMMLab Model Deployment Framework