A toolkit to optimize ML models for deployment for Keras & TensorFlow
Trainable models and NN optimization tools
Neural Network Compression Framework for enhanced OpenVINO
Deep learning optimization library: makes distributed training easy
Libraries for applying sparsification recipes to neural networks
Optimizing inference proxy for LLMs
Uplift modeling and causal inference with machine learning algorithms
A high-performance ML model serving framework, offers dynamic batching
A unified framework for scalable computing
Build your chatbot within minutes on your favorite device
Database system for building simpler and faster AI-powered application
CPU/GPU inference server for Hugging Face transformer models