PyTorch library of curated Transformer models and their components
Port of OpenAI's Whisper model in C/C++
Easiest and laziest way for building multi-agent LLMs applications
Multilingual Automatic Speech Recognition with word-level timestamps
Tensor search for humans
Easy-to-use deep learning framework with 3 key features
Pytorch domain library for recommendation systems
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Unified Model Serving Framework
A set of Docker images for training and serving models in TensorFlow
Superduper: Integrate AI models and machine learning workflows
20+ high-performance LLMs with recipes to pretrain, finetune at scale
GPU environment management and cluster orchestration
Lightweight Python library for adding real-time multi-object tracking
Fast inference engine for Transformer models
Low-latency REST API for serving text-embeddings
A library for accelerating Transformer models on NVIDIA GPUs
An MLOps framework to package, deploy, monitor and manage models
Create HTML profiling reports from pandas DataFrame objects
Open platform for training, serving, and evaluating language models
Unofficial (Golang) Go bindings for the Hugging Face Inference API
A GPU-accelerated library containing highly optimized building blocks
High-performance neural network inference framework for mobile
MII makes low-latency and high-throughput inference possible
Self-hosted, community-driven, local OpenAI compatible API