A GPU-accelerated library containing highly optimized building blocks
Standardized Serverless ML Inference Platform on Kubernetes
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Data manipulation and transformation for audio signal processing
Openai style api for open large language models
Large Language Model Text Generation Inference
OpenVINO™ Toolkit repository
Unified Model Serving Framework
Easy-to-use deep learning framework with 3 key features
A unified framework for scalable computing
Database system for building simpler and faster AI-powered application
Toolkit for allowing inference and serving with MXNet in SageMaker