Deep Learning API and Server in C++14 support for Caffe, PyTorch
Everything you need to build state-of-the-art foundation models
A RWKV management and startup tool, full automation, only 8MB
Uncover insights, surface problems, monitor, and fine tune your LLM
Unified Model Serving Framework
A set of Docker images for training and serving models in TensorFlow
Private Open AI on Kubernetes
Build Production-ready Agentic Workflow with Natural Language
A lightweight vision library for performing large object detection
Single-cell analysis in Python
Bayesian inference with probabilistic programming
Trainable models and NN optimization tools
Easiest and laziest way for building multi-agent LLMs applications
A unified framework for scalable computing
Bolt is a deep learning library with high performance
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Replace OpenAI GPT with another LLM in your app
Framework that is dedicated to making neural data processing
MII makes low-latency and high-throughput inference possible
Serving system for machine learning models
Optimizing inference proxy for LLMs
Neural Network Compression Framework for enhanced OpenVINO
Efficient few-shot learning with Sentence Transformers
Large Language Model Text Generation Inference
On-device AI across mobile, embedded and edge for PyTorch