Everything you need to build state-of-the-art foundation models
A toolkit to optimize ML models for deployment for Keras & TensorFlow
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Uncover insights, surface problems, monitor, and fine tune your LLM
Build your chatbot within minutes on your favorite device
Probabilistic reasoning and statistical analysis in TensorFlow
Run Local LLMs on Any Device. Open-source
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Trainable models and NN optimization tools
An MLOps framework to package, deploy, monitor and manage models
A high-throughput and memory-efficient inference and serving engine
Powering Amazon custom machine learning chips
A library for accelerating Transformer models on NVIDIA GPUs
GPU environment management and cluster orchestration
Training and deploying machine learning models on Amazon SageMaker
Operating LLMs in production
Multilingual Automatic Speech Recognition with word-level timestamps
Replace OpenAI GPT with another LLM in your app
A set of Docker images for training and serving models in TensorFlow
Openai style api for open large language models
Phi-3.5 for Mac: Locally-run Vision and Language Models
A Unified Library for Parameter-Efficient Learning
State-of-the-art Parameter-Efficient Fine-Tuning
Python Package for ML-Based Heterogeneous Treatment Effects Estimation