support system java free download

TorchServe

Serve, optimize and scale PyTorch models in production

...Out-of-box support for system-level metrics with Prometheus exports, custom metrics and PyTorch profiler support.

Downloads: 1 This Week

Last Update: 2024-09-30

See Project

whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples....

Downloads: 365 This Week

Last Update: 2026-03-19

See Project

Distributed Llama

Connect home devices into a powerful cluster to accelerate LLM

Distributed Llama is an open-source project that enables users to connect multiple home devices into a powerful cluster to accelerate Large Language Model (LLM) inference. By leveraging tensor parallelism and high-speed synchronization over Ethernet, it allows for faster performance as more devices are added to the cluster. The system supports various operating systems, including Linux, macOS, and Windows, and is optimized for both ARM and x86_64 AVX2 CPUs.

Downloads: 3 This Week

Last Update: 2026-02-02

See Project

ScaleLLM

A high-performance inference system for large language models

ScaleLLM is a high-performance inference system tailored for Large Language Models (LLMs), specifically designed for production environments. It focuses on optimizing inference processes to handle large-scale deployments efficiently, ensuring low latency and high throughput. ScaleLLM supports various LLM architectures and integrates with existing infrastructures, providing a scalable solution for deploying LLMs in real-world applications.

Downloads: 0 This Week

Last Update: 2025-09-13

See Project

OpenVINO Model Server

A scalable inference server for models optimized with OpenVINO

OpenVINO™ Model Server is a high-performance inference serving system designed to host and serve machine learning models that have been optimized with the OpenVINO toolkit. It’s implemented in C++ for scalability and efficiency, making it suitable for both edge and cloud deployments where inference workloads must be reliable and high throughput. The server exposes model inference via standard network protocols like REST and gRPC, allowing any client that speaks those protocols to request...

Downloads: 2 This Week

Last Update: 2026-04-08

See Project

EvaDB

Database system for building simpler and faster AI-powered application

Over the last decade, AI models have radically changed the world of natural language processing and computer vision. They are accurate on various tasks ranging from question answering to object tracking in videos. To use an AI model, the user needs to program against multiple low-level libraries, like PyTorch, Hugging Face, Open AI, etc. This tedious process often leads to a complex AI app that glues together these libraries to accomplish the given task. This programming complexity prevents...

Downloads: 3 This Week

Last Update: 2023-11-19

See Project

Search Results for "support system java"

Showing 6 open source projects for "support system java"

TorchServe

whisper.cpp

Distributed Llama

ScaleLLM

OpenVINO Model Server

EvaDB

Search Results for "support system java"

Showing 6 open source projects for "support system java"

TorchServe

whisper.cpp

Distributed Llama

ScaleLLM

OpenVINO Model Server

EvaDB

Related Searches

Related Categories