Multilingual Automatic Speech Recognition with word-level timestamps
Run 100B+ language models at home, BitTorrent-style
Library for OCR-related tasks powered by Deep Learning
Uplift modeling and causal inference with machine learning algorithms
A unified framework for scalable computing
Standardized Serverless ML Inference Platform on Kubernetes
State-of-the-art diffusion models for image and audio generation
Deep learning optimization library: makes distributed training easy
Official inference library for Mistral models
State-of-the-art Parameter-Efficient Fine-Tuning
MII makes low-latency and high-throughput inference possible
Bring the notion of Model-as-a-Service to life
A lightweight vision library for performing large object detection
LLMFlows - Simple, Explicit and Transparent LLM Apps
20+ high-performance LLMs with recipes to pretrain, finetune at scale
A high-performance ML model serving framework, offers dynamic batching
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
An easy-to-use LLMs quantization package with user-friendly apis
Pytorch domain library for recommendation systems
Trainable models and NN optimization tools
Tensor search for humans
A set of Docker images for training and serving models in TensorFlow
The official Python client for the Huggingface Hub