Run Local LLMs on Any Device. Open-source
Data manipulation and transformation for audio signal processing
Everything you need to build state-of-the-art foundation models
A high-throughput and memory-efficient inference and serving engine
Standardized Serverless ML Inference Platform on Kubernetes
The official Python client for the Huggingface Hub
State-of-the-art diffusion models for image and audio generation
Training and deploying machine learning models on Amazon SageMaker
Replace OpenAI GPT with another LLM in your app
Easiest and laziest way for building multi-agent LLMs applications
A Pythonic framework to simplify AI service building
Deep learning optimization library: makes distributed training easy
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Gaussian processes in TensorFlow
Optimizing inference proxy for LLMs
Phi-3.5 for Mac: Locally-run Vision and Language Models
Operating LLMs in production
Uncover insights, surface problems, monitor, and fine tune your LLM
Powering Amazon custom machine learning chips
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
FlashInfer: Kernel Library for LLM Serving
An MLOps framework to package, deploy, monitor and manage models
Lightweight Python library for adding real-time multi-object tracking
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
AIMET is a library that provides advanced quantization and compression