Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
A unified framework for scalable computing
Easiest and laziest way for building multi-agent LLMs applications
Lightweight Python library for adding real-time multi-object tracking
Bring the notion of Model-as-a-Service to life
Pytorch domain library for recommendation systems
Large Language Model Text Generation Inference
Phi-3.5 for Mac: Locally-run Vision and Language Models
Multilingual Automatic Speech Recognition with word-level timestamps
Open platform for training, serving, and evaluating language models
MII makes low-latency and high-throughput inference possible
High quality, fast, modular reference implementation of SSD in PyTorch
A computer vision framework to create and deploy apps in minutes
Serve machine learning models within a Docker container
Lightweight anchor-free object detection model