INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
An MLOps framework to package, deploy, monitor and manage models
Fast inference engine for Transformer models
State-of-the-art Parameter-Efficient Fine-Tuning
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Simplifies the local serving of AI models from any source
A toolkit to optimize ML models for deployment for Keras & TensorFlow
Unified Model Serving Framework
AIMET is a library that provides advanced quantization and compression
Low-latency REST API for serving text-embeddings
A library for accelerating Transformer models on NVIDIA GPUs
LLM training code for MosaicML foundation models
A lightweight vision library for performing large object detection
Create HTML profiling reports from pandas DataFrame objects
Library for serving Transformers models on Amazon SageMaker
GPU environment management and cluster orchestration
Open-Source AI Camera. Empower any camera/CCTV
Tensor search for humans
A GPU-accelerated library containing highly optimized building blocks
Deep learning optimization library: makes distributed training easy
OpenMLDB is an open-source machine learning database
Easy-to-use deep learning framework with 3 key features
PyTorch library of curated Transformer models and their components
A library to communicate with ChatGPT, Claude, Copilot, Gemini
A graphical manager for ollama that can manage your LLMs