Easy-to-use Speech Toolkit including Self-Supervised Learning model
A library for accelerating Transformer models on NVIDIA GPUs
Lightweight Python library for adding real-time multi-object tracking
A set of Docker images for training and serving models in TensorFlow
AIMET is a library that provides advanced quantization and compression
Open-source tool designed to enhance the efficiency of workloads
Simplifies the local serving of AI models from any source
LLM training code for MosaicML foundation models
Optimizing inference proxy for LLMs
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Probabilistic reasoning and statistical analysis in TensorFlow
Libraries for applying sparsification recipes to neural networks
Single-cell analysis in Python
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Efficient few-shot learning with Sentence Transformers
Superduper: Integrate AI models and machine learning workflows
MII makes low-latency and high-throughput inference possible
Official inference library for Mistral models
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Adversarial Robustness Toolbox (ART) - Python Library for ML security
A lightweight vision library for performing large object detection
Library for serving Transformers models on Amazon SageMaker