A scalable inference server for models optimized with OpenVINO
Fast LLM speculative inference server for consumer hardware
LLM inference in C/C++
High-speed Large Language Model Serving for Local Deployment
Optical-packet node transceiver frequency allocation
Tool to remotely activate Text-To-Speech (TTS) on a server