Training and deploying machine learning models on Amazon SageMaker
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
Powering Amazon custom machine learning chips
A high-throughput and memory-efficient inference and serving engine
Easy-to-use deep learning framework with 3 key features
A set of Docker images for training and serving models in TensorFlow
Create HTML profiling reports from pandas DataFrame objects
The official Python client for the Huggingface Hub
Optimizing inference proxy for LLMs
Fast inference engine for Transformer models
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Multilingual Automatic Speech Recognition with word-level timestamps
Uncover insights, surface problems, monitor, and fine tune your LLM
State-of-the-art diffusion models for image and audio generation
OpenMMLab Model Deployment Framework
Unified Model Serving Framework
Uplift modeling and causal inference with machine learning algorithms
Deep learning optimization library: makes distributed training easy
A unified framework for scalable computing
Easiest and laziest way for building multi-agent LLMs applications
Replace OpenAI GPT with another LLM in your app
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Integrate, train and manage any AI models and APIs with your database
Database system for building simpler and faster AI-powered application