server performance free download

Showing 71 open source projects for "server performance"

View related business solutions

Artificial Intelligence Windows Clear Filters & Widen Search

Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
Earn up to 16% annual interest with Nexo.
Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
1

Velocity server

The modern, next-generation Minecraft server proxy

...The software is designed with a focus on performance, scalability, and modern architecture, allowing it to handle thousands of simultaneous players efficiently. Velocity also includes a plugin API that allows developers to extend the proxy with custom functionality and integrate it with existing server tools. Compared with older proxy solutions, the project emphasizes improved performance, reliability, and cleaner software design.

Downloads: 3 This Week

Last Update: 2026-05-09
See Project
2

VikingDB MCP Server

A mcp server for vikingdb store and search

An MCP server that interfaces with VikingDB, a high-performance vector database developed by ByteDance, enabling efficient vector storage and search capabilities.

Downloads: 0 This Week

Last Update: 2025-04-07
See Project
3

Tiledesk Server

Tiledesk Server is the main API component of the Tiledesk platform

Tiledesk Server is the backend component of the Tiledesk platform, providing a comprehensive open-source live chat system with integrated chatbot capabilities for customer support and engagement.

Downloads: 0 This Week

Last Update: 2025-08-27
See Project
4

DB MCP Server

A powerful multi-database server implementing the MCP

The DB MCP Server is a powerful multi-database server implementing the Model Context Protocol (MCP) to provide AI assistants with structured access to databases. Built on the FreePeak/cortex framework, it enables execution of SQL queries, transaction management, schema exploration, and performance analysis across different database systems through a unified interface.

Downloads: 0 This Week

Last Update: 2026-04-11
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

MCP Server MySQL

A Model Context Protocol server

A Model Context Protocol server that provides access to MySQL databases, enabling Large Language Models to inspect database schemas and execute SQL queries, facilitating seamless database interactions.

Downloads: 1 This Week

Last Update: 2026-01-27
See Project
6

csghub-server

csghub-server is the backend server for CSGHub

csghub-server is the backend component of the CSGHub platform, an open-source infrastructure designed to manage and operate large language models, datasets, and AI development workflows within a private deployment environment. The server acts as a centralized management layer that allows teams to store, organize, and operate AI assets such as models, datasets, and machine learning applications in a manner similar to artifact repositories used in software engineering. Built primarily in the...

Downloads: 0 This Week

Last Update: 2026-05-06
See Project
7

Triton Inference Server

The Triton Inference Server provides an optimized cloud

...Triton delivers optimized performance for many query types, including real-time, batched, ensembles, and audio/video streaming. Provides Backend API that allows adding custom backends and pre/post-processing operations. Model pipelines using Ensembling or Business Logic Scripting (BLS). HTTP/REST and GRPC inference protocols based on the community-developed KServe protocol.

Downloads: 1 This Week

Last Update: 2026-04-28
See Project
8

Browserbase MCP Server

Allow LLMs to control a browser with Browserbase and Stagehand

Browserbase MCP Server is a server implementation of the Model Context Protocol (MCP) that enables large language models to interact with web browsers programmatically through cloud-based automation. The project provides a standardized interface for connecting AI systems to real-world web environments, allowing them to navigate pages, extract structured data, and perform user-like actions such as clicking, typing, and form submission.

Downloads: 0 This Week

Last Update: 2026-03-31
See Project
9

OpenVINO Model Server

A scalable inference server for models optimized with OpenVINO

OpenVINO™ Model Server is a high-performance inference serving system designed to host and serve machine learning models that have been optimized with the OpenVINO toolkit. It’s implemented in C++ for scalability and efficiency, making it suitable for both edge and cloud deployments where inference workloads must be reliable and high throughput. The server exposes model inference via standard network protocols like REST and gRPC, allowing any client that speaks those protocols to request predictions remotely, abstracting away the complexity of where and how the model runs. ...

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

Google Workspace MCP Server

Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms

Google Workspace MCP is an open-source server that connects AI assistants to Google Workspace services through the Model Context Protocol (MCP), allowing large language models to interact directly with productivity tools. The project exposes a wide set of Google services including Gmail, Google Drive, Docs, Sheets, Slides, Calendar, Chat, and other Workspace components as structured tools that an AI system can call programmatically. By acting as a bridge between AI clients and the Google...

Downloads: 0 This Week

Last Update: 5 days ago
See Project
11

Lemonade

Lemonade helps users run local LLMs with the highest performance

Lemonade is a local LLM runtime that aims to deliver the highest possible performance on your own hardware by auto-configuring state-of-the-art inference engines for both NPUs and GPUs. The project positions itself as a “local LLM server” you can run on laptops and workstations, abstracting away backend differences while giving you a single place to serve and manage models. Its README emphasizes real-world adoption across startups, research groups, and large companies, signaling a focus on practical deployments rather than toy demos. ...

Downloads: 8 This Week

Last Update: 1 day ago
See Project
12

Nestia

NestJS Helper + AI Chatbot Development

Nestia is a high-performance toolkit and ecosystem built on top of NestJS that enhances backend development by introducing strongly typed APIs, automated SDK generation, and advanced tooling for scalable server applications. It is designed to eliminate much of the boilerplate typically associated with API development by leveraging pure TypeScript types to automatically generate validation logic, API documentation, and client SDKs.

Downloads: 1 This Week

Last Update: 2026-05-13
See Project
13

Text Embeddings Inference

High-performance inference server for text embeddings models API layer

Text Embeddings Inference is a high-performance server designed to serve text embedding models efficiently in production environments. It focuses on delivering fast and scalable embedding generation by leveraging optimized inference techniques and modern hardware acceleration. It is built to support transformer-based embedding models, making it suitable for tasks such as semantic search, clustering, and retrieval-augmented systems.

Downloads: 2 This Week

Last Update: 2026-03-23
See Project
14

Chrome DevTools MCP

Chrome DevTools for coding agents

...The repository spells out environment requirements and cautions that exposing a live browser to agents grants powerful access, so sensitive data should be handled carefully. Beyond static inspection, it exposes operational tools like starting a performance trace that an agent can later analyze to propose optimizations. The server is intended to slot into MCP-capable assistants and IDEs, giving them reliable, typed tools and resource endpoints rather than ad-hoc automation. Documentation from the Chrome team explains how the server augments agents with real debugging capabilities.

Downloads: 6 This Week

Last Update: 4 days ago
See Project
15

mistral.rs

Fast, flexible LLM inference

mistral.rs is a fast and flexible LLM inference engine implemented in Rust, designed to run and serve modern language models with an emphasis on performance and practical deployment. It provides multiple entry points for developers, including a CLI for running models locally and an HTTP server that exposes an OpenAI-compatible API surface for easy integration with existing clients. The project includes hardware-aware tooling that can benchmark a system and choose sensible quantization and device-mapping strategies, helping users get strong performance without manual tuning. ...

Downloads: 1 This Week

Last Update: 2026-04-02
See Project
16

Shadcn UI v4 MCP Server

A mcp server to allow LLMS gain context about shadcn ui component

...The server supports multiple frontend frameworks including React, Svelte, Vue, and React Native, making it highly versatile for cross-platform development. It includes smart caching and efficient GitHub API usage to optimize performance and handle rate limits during component retrieval. The system also supports multiple transport modes such as standard input/output and Server-Sent Events, enabling both local and distributed deployments.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
17

OpenJarvis

Personal AI, On Personal Devices

OpenJarvis is an open-source framework designed to build personal AI agents that run primarily on local devices rather than relying on cloud infrastructure. Developed as part of the Intelligence Per Watt research initiative, it focuses on improving the efficiency and practicality of on-device AI systems. The framework provides shared primitives for building local-first agents, along with evaluation tools that measure performance using metrics such as energy consumption, latency, cost, and...

Downloads: 248 This Week

Last Update: 6 days ago
See Project
18

Zed

High-performance, multiplayer code editor from the creators of Atom

Zed is a next-generation code editor designed for high-performance collaboration with humans and AI. Written from scratch in Rust to efficiently leverage multiple CPU cores and your GPU. Integrate upcoming LLMs into your workflow to generate, transform, and analyze code. Chat with teammates, write notes together, and share your screen and project. Multibuffers compose excerpts from across the codebase in one editable surface.

Downloads: 47 This Week

Last Update: 23 hours ago
See Project
19

MongoDB Lens

MongoDB Lens: Full Featured MCP Server for MongoDB Databases

MongoDB Lens is a local Model Context Protocol (MCP) server offering full-featured access to MongoDB databases using natural language via large language models (LLMs). It enables users to perform queries, run aggregations, optimize performance, and more through conversational interactions.

Downloads: 0 This Week

Last Update: 2025-04-23
See Project
20

DramaBox

super expressive prompting model based on ltx2.3

...It generates speech from prompts that control not only the spoken text, but also speaker identity, emotion, delivery style, laughs, sighs, pauses, and transitions. Users can optionally provide a voice reference of around 10 seconds or more to clone the target timbre while still guiding performance through scene-style prompting. The project includes a warm inference server, a CLI workflow, and a Gradio app for interactive generation. It also supports additional LoRA training on top of DramaBox, making it possible to adapt the model for a specific speaker, language flavor, or performance style. DramaBox is aimed at developers, researchers, and audio creators who need highly expressive English TTS for character dialogue, narrative audio, prototyping, or voice experimentation.

Downloads: 5 This Week

Last Update: 2026-05-14
See Project
21

Angel

A Flexible and Powerful Parameter Server for large-scale ML

Angel is a high-performance distributed machine learning and graph computing platform based on the philosophy of Parameter Server. It is tuned for performance with big data from Tencent and has a wide range of applicability and stability, demonstrating an increasing advantage in handling higher-dimension models. Angel is jointly developed by Tencent and Peking University, taking account of both high availability in industry and innovation in academia.

Downloads: 0 This Week

Last Update: 2025-09-03
See Project
22

FastSD CPU

Fast stable diffusion on CPU and AI PC

...The repository contains multiple interfaces including a desktop GUI for simple generation, an advanced web-based UI with support for extensions like LoRA and ControlNet, and a command-line interface for scripted usage or server deployments. With support for performance-oriented libraries such as OpenVINO and hardware acceleration on platforms like Intel AI PCs, FastSD CPU aims to shrink generation times dramatically compared with naive CPU implementations.

Downloads: 35 This Week

Last Update: 2026-05-02
See Project
23

Text Generation Inference

Large Language Model Text Generation Inference

Text Generation Inference is a high-performance inference server for text generation models, optimized for Hugging Face's Transformers. It is designed to serve large language models efficiently with optimizations for performance and scalability.

Downloads: 0 This Week

Last Update: 2025-12-18
See Project
24

Fantasy PL MCP

Fantasy Premier League MCP Server

Fantasy Premier League MCP Server is a Model Context Protocol (MCP) server that provides access to Fantasy Premier League (FPL) data and tools. It allows interaction with FPL data in MCP-compatible clients, enabling users to manage their fantasy teams effectively.

Downloads: 0 This Week

Last Update: 2025-07-31
See Project
25

ds4.c

DeepSeek 4 Flash local inference engine for Metal

...Unlike general-purpose inference runtimes, the project is intentionally optimized for a specific model family, enabling highly efficient execution and simplified architecture. The engine includes DS4-specific model loading, KV cache management, prompt rendering, and OpenAI-compatible server APIs for local deployment workflows. Built as a native low-level implementation, it focuses on performance, reduced abstraction overhead, and direct integration with Apple GPU acceleration through Metal compute graphs. The project also supports streaming inference behavior and local API serving for integration with external tools and AI applications. ...

Downloads: 2 This Week

Last Update: 2026-05-15
See Project