Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Build your chatbot within minutes on your favorite device
Create HTML profiling reports from pandas DataFrame objects
Fast inference engine for Transformer models
State-of-the-art Parameter-Efficient Fine-Tuning
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Simplifies the local serving of AI models from any source
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Unified Model Serving Framework
GPU environment management and cluster orchestration
Low-latency REST API for serving text-embeddings
A library for accelerating Transformer models on NVIDIA GPUs
LLM training code for MosaicML foundation models
An MLOps framework to package, deploy, monitor and manage models
A lightweight vision library for performing large object detection
Library for serving Transformers models on Amazon SageMaker
Tensor search for humans
AIMET is a library that provides advanced quantization and compression
Open-Source AI Camera. Empower any camera/CCTV
A GPU-accelerated library containing highly optimized building blocks
Deep learning optimization library: makes distributed training easy
OpenMLDB is an open-source machine learning database
Easy-to-use deep learning framework with 3 key features
PyTorch library of curated Transformer models and their components