A toolkit to optimize ML models for deployment for Keras & TensorFlow
Trainable models and NN optimization tools
Neural Network Compression Framework for enhanced OpenVINO
A high-performance inference system for large language models
OpenVINO™ Toolkit repository
Open standard for machine learning interoperability
Framework which allows you transform your Vector Database
Libraries for applying sparsification recipes to neural networks
Fast inference engine for Transformer models
Optimizing inference proxy for LLMs
Deep learning optimization library: makes distributed training easy
Connect home devices into a powerful cluster to accelerate LLM
An Open-Source Programming Framework for Agentic AI
Bolt is a deep learning library with high performance
Uplift modeling and causal inference with machine learning algorithms
A high-performance ML model serving framework, offers dynamic batching
A unified framework for scalable computing
Build your chatbot within minutes on your favorite device
Database system for building simpler and faster AI-powered application
CPU/GPU inference server for Hugging Face transformer models
Deep learning inference framework optimized for mobile platforms
Uniform deep learning inference framework for mobile
Fast and user-friendly runtime for transformer inference