Open platform for training, serving, and evaluating language models
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Tensor search for humans
Run 100B+ language models at home, BitTorrent-style
A toolkit to optimize ML models for deployment for Keras & TensorFlow
High quality, fast, modular reference implementation of SSD in PyTorch
Create HTML profiling reports from pandas DataFrame objects
Library for serving Transformers models on Amazon SageMaker
Serve machine learning models within a Docker container
OpenMLDB is an open-source machine learning database
A GPU-accelerated library containing highly optimized building blocks
Implementation of "Tree of Thoughts
Toolbox of models, callbacks, and datasets for AI/ML researchers
Implementation of model parallel autoregressive transformers on GPUs
Sequence-to-sequence framework, focused on Neural Machine Translation
A graphical manager for ollama that can manage your LLMs
A computer vision framework to create and deploy apps in minutes
OpenMMLab Video Perception Toolbox
Training & Implementation of chatbots leveraging GPT-like architecture
Guide to deploying deep-learning inference networks
Toolkit for allowing inference and serving with MXNet in SageMaker
CPU/GPU inference server for Hugging Face transformer models
Deploy a ML inference service on a budget in 10 lines of code