The AI-native (edge and LLM) proxy for agents
On-device AI across mobile, embedded and edge for PyTorch
Open-Source AI Camera. Empower any camera/CCTV
OpenVINO™ Toolkit repository
Replace OpenAI GPT with another LLM in your app
A toolkit to optimize ML models for deployment for Keras & TensorFlow
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
The Triton Inference Server provides an optimized cloud
Standardized Serverless ML Inference Platform on Kubernetes
A scalable inference server for models optimized with OpenVINO
Sparsity-aware deep learning inference runtime for CPUs
A Pythonic framework to simplify AI service building
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Easy-to-use Speech Toolkit including Self-Supervised Learning model
AIMET is a library that provides advanced quantization and compression
Tensor search for humans
A unified framework for scalable computing
Images to inference with no labeling
A computer vision framework to create and deploy apps in minutes