Port of OpenAI's Whisper model in C/C++
Serve, optimize and scale PyTorch models in production
Connect home devices into a powerful cluster to accelerate LLM
A scalable inference server for models optimized with OpenVINO
Database system for building simpler and faster AI-powered application