Adversarial Robustness Toolbox (ART) - Python Library for ML security
The Triton Inference Server provides an optimized cloud
Run Local LLMs on Any Device. Open-source
Data manipulation and transformation for audio signal processing
Everything you need to build state-of-the-art foundation models
A high-throughput and memory-efficient inference and serving engine
Standardized Serverless ML Inference Platform on Kubernetes
The official Python client for the Huggingface Hub
Training and deploying machine learning models on Amazon SageMaker
Easiest and laziest way for building multi-agent LLMs applications
Replace OpenAI GPT with another LLM in your app
State-of-the-art diffusion models for image and audio generation
A set of Docker images for training and serving models in TensorFlow
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Gaussian processes in TensorFlow
Optimizing inference proxy for LLMs
A Pythonic framework to simplify AI service building
Phi-3.5 for Mac: Locally-run Vision and Language Models
Operating LLMs in production
Uncover insights, surface problems, monitor, and fine tune your LLM
Powering Amazon custom machine learning chips
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
FlashInfer: Kernel Library for LLM Serving
An MLOps framework to package, deploy, monitor and manage models
Lightweight Python library for adding real-time multi-object tracking