Unified Model Serving Framework
Deep learning optimization library: makes distributed training easy
Low-latency REST API for serving text-embeddings
A library for accelerating Transformer models on NVIDIA GPUs
Standardized Serverless ML Inference Platform on Kubernetes
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
LLM training code for MosaicML foundation models
A lightweight vision library for performing large object detection
Create HTML profiling reports from pandas DataFrame objects
Library for serving Transformers models on Amazon SageMaker
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
Powering Amazon custom machine learning chips
OpenMLDB is an open-source machine learning database
A GPU-accelerated library containing highly optimized building blocks
Serve machine learning models within a Docker container
LLMFlows - Simple, Explicit and Transparent LLM Apps
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Framework for Accelerating LLM Generation with Multiple Decoding Heads
A graphical manager for ollama that can manage your LLMs
Run 100B+ language models at home, BitTorrent-style
OpenFieldAI is an AI based Open Field Test Rodent Tracker
A computer vision framework to create and deploy apps in minutes
Implementation of "Tree of Thoughts
Toolbox of models, callbacks, and datasets for AI/ML researchers