server performance free download

VikingDB MCP Server

A mcp server for vikingdb store and search

An MCP server that interfaces with VikingDB, a high-performance vector database developed by ByteDance, enabling efficient vector storage and search capabilities.

Downloads: 0 This Week

Last Update: 2025-04-07

See Project

Triton Inference Server

The Triton Inference Server provides an optimized cloud

...Triton delivers optimized performance for many query types, including real-time, batched, ensembles, and audio/video streaming. Provides Backend API that allows adding custom backends and pre/post-processing operations. Model pipelines using Ensembling or Business Logic Scripting (BLS). HTTP/REST and GRPC inference protocols based on the community-developed KServe protocol.

Downloads: 1 This Week

Last Update: 2026-04-28

See Project

Google Workspace MCP Server

Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms

Google Workspace MCP is an open-source server that connects AI assistants to Google Workspace services through the Model Context Protocol (MCP), allowing large language models to interact directly with productivity tools. The project exposes a wide set of Google services including Gmail, Google Drive, Docs, Sheets, Slides, Calendar, Chat, and other Workspace components as structured tools that an AI system can call programmatically. By acting as a bridge between AI clients and the Google...

Downloads: 0 This Week

Last Update: 6 days ago

See Project

Lemonade

Lemonade helps users run local LLMs with the highest performance

Lemonade is a local LLM runtime that aims to deliver the highest possible performance on your own hardware by auto-configuring state-of-the-art inference engines for both NPUs and GPUs. The project positions itself as a “local LLM server” you can run on laptops and workstations, abstracting away backend differences while giving you a single place to serve and manage models. Its README emphasizes real-world adoption across startups, research groups, and large companies, signaling a focus on practical deployments rather than toy demos. ...

Downloads: 8 This Week

Last Update: 1 day ago

See Project

Text Embeddings Inference

High-performance inference server for text embeddings models API layer

Text Embeddings Inference is a high-performance server designed to serve text embedding models efficiently in production environments. It focuses on delivering fast and scalable embedding generation by leveraging optimized inference techniques and modern hardware acceleration. It is built to support transformer-based embedding models, making it suitable for tasks such as semantic search, clustering, and retrieval-augmented systems.

Downloads: 2 This Week

Last Update: 2026-03-23

See Project

FastHX

FastAPI server-side rendering with built-in HTMX support.

FastHX is a high-performance HTTP and WebSocket server framework designed for Haxe, enabling fast and scalable web application development.

Downloads: 0 This Week

Last Update: 2025-12-31

See Project

pg_activity

pg_activity is a top like application for PostgreSQL server activity

Command line tool for PostgreSQL server activity monitoring. pg_activity is a PostgreSQL monitoring tool that provides real-time insights into database performance, helping database administrators manage and troubleshoot PostgreSQL instances.

Downloads: 0 This Week

Last Update: 2025-06-03

See Project

Agent360

360 monitoring agent

360 Monitoring is a web service that monitors and displays statistics of your server performance. Agent360 is OS-agnostic software compatible with Python 3.7 and 3.8. It's been optimized to have low CPU consumption and comes with an extendable set of useful plugins.

Downloads: 0 This Week

Last Update: 2026-02-02

See Project

OpenJarvis

Personal AI, On Personal Devices

OpenJarvis is an open-source framework designed to build personal AI agents that run primarily on local devices rather than relying on cloud infrastructure. Developed as part of the Intelligence Per Watt research initiative, it focuses on improving the efficiency and practicality of on-device AI systems. The framework provides shared primitives for building local-first agents, along with evaluation tools that measure performance using metrics such as energy consumption, latency, cost, and...

Downloads: 248 This Week

Last Update: 6 days ago

See Project

waitress

A WSGI server for Python 3

Waitress is meant to be a production-quality pure-Python WSGI server with very acceptable performance. It has no dependencies except ones which live in the Python standard library. It runs on CPython on Unix and Windows under Python 3.7+. It is also known to run on PyPy 3 (python version 3.7+) on UNIX. It supports HTTP/1.0 and HTTP/1.1. Waitress now validates that chunked encoding extensions are valid, and don't contain invalid characters that are not allowed.

Downloads: 1 This Week

Last Update: 2024-11-16

See Project

FastSD CPU

Fast stable diffusion on CPU and AI PC

...The repository contains multiple interfaces including a desktop GUI for simple generation, an advanced web-based UI with support for extensions like LoRA and ControlNet, and a command-line interface for scripted usage or server deployments. With support for performance-oriented libraries such as OpenVINO and hardware acceleration on platforms like Intel AI PCs, FastSD CPU aims to shrink generation times dramatically compared with naive CPU implementations.

Downloads: 35 This Week

Last Update: 2026-05-02

See Project

socketify.py

Bringing Http/Https and WebSockets High Performance servers for PyPy3

Socketify.py is a reliable, high-performance Python web framework for building large-scale app backends and microservices. With no precedents websocket performance and a really fast HTTP server that can delivery encrypted TLS 1.3 quicker than most alternative servers can do even unencrypted, cleartext messaging.

Downloads: 0 This Week

Last Update: 2024-10-29

See Project

ty

An extremely fast Python type checker and language server

ty is an extremely fast Python type checker and language server built in Rust, designed to provide highly responsive and accurate static analysis for modern Python development workflows. It is positioned as a next-generation alternative to tools such as mypy and Pyright, offering significantly faster performance through incremental analysis and optimized execution. The tool is designed from the ground up to power editor integrations, enabling real-time feedback as developers write code with minimal latency. ty includes advanced type system capabilities such as intersection types, improved type inference, and detailed diagnostics that help identify issues even in partially typed codebases. ...

Downloads: 0 This Week

Last Update: 12 hours ago

See Project

Text Generation Inference

Large Language Model Text Generation Inference

Text Generation Inference is a high-performance inference server for text generation models, optimized for Hugging Face's Transformers. It is designed to serve large language models efficiently with optimizations for performance and scalability.

Downloads: 0 This Week

Last Update: 2025-12-18

See Project

Fantasy PL MCP

Fantasy Premier League MCP Server

Fantasy Premier League MCP Server is a Model Context Protocol (MCP) server that provides access to Fantasy Premier League (FPL) data and tools. It allows interaction with FPL data in MCP-compatible clients, enabling users to manage their fantasy teams effectively.

Downloads: 0 This Week

Last Update: 2025-07-31

See Project

Scrapling

An adaptive Web Scraping framework

...The framework includes advanced fetchers capable of bypassing anti-bot protections such as Cloudflare Turnstile using stealth and browser automation techniques. Its powerful spider system supports multi-session crawling, pause and resume functionality, and real-time streaming of scraped data. Scrapling combines high performance, memory efficiency, and extensive async support to deliver blazing-fast scraping workflows. With a developer-friendly API, CLI tools, MCP server integration for AI-assisted extraction, and Docker support, it offers a complete solution for modern web scrapers.

Downloads: 6 This Week

Last Update: 2026-05-11

See Project

Maltrail

Malicious traffic detection system

Maltrail is a malicious traffic detection system, utilizing publicly available (black)lists containing malicious and/or generally suspicious trails, along with static trails compiled from various AV reports and custom user-defined lists, where trail can be anything from domain name, URL, IP address (e.g. 185.130.5.231 for the known attacker) or HTTP User-Agent header value (e.g. sqlmap for automatic SQL injection and database takeover tool). Also, it uses (optional) advanced heuristic...

Downloads: 3 This Week

Last Update: 2026-04-30

See Project

Sanic

Async Python 3.6+ web server/framework

Build fast, run fast with Sanic! Sanic is a Python 3.6+ web server and web framework designed to go fast. It provides a way to get a highly performant HTTP server up and running fast, while also making it easy to build, expand, and eventually scale. Sanic aspires to be as simple as possible while delivering the performance that you require. It allows the usage of the async/await syntax added in Python 3.5, so your code is guaranteed to be non-blocking and speedy. ...

Downloads: 1 This Week

Last Update: 2025-12-31

See Project

asyncpg

A fast PostgreSQL Database Client Library for Python/asyncio

asyncpg is a high-performance PostgreSQL client library designed for Python's asyncio framework. It offers a clean and efficient implementation of the PostgreSQL server binary protocol, enabling developers to execute database operations asynchronously. This approach allows for scalable and responsive applications that can handle numerous concurrent database connections.

Downloads: 0 This Week

Last Update: 2025-11-24

See Project

django-health-check

a pluggable app that runs a full check on the deployment

The primary intended use case is to monitor conditions via HTTP(S), with responses available in HTML and JSON formats. When you get back a response that includes one or more problems, you can then decide the appropriate course of action, which could include generating notifications and/or automating the replacement of a failing node with a new one. If you are monitoring health in a high-availability environment with a load balancer that returns responses from multiple nodes, please note that...

Downloads: 0 This Week

Last Update: 2026-05-15

See Project

NVIDIA cuOpt

GPU accelerated decision optimization

...Built primarily in C++, cuOpt leverages NVIDIA GPUs to deliver near real-time solutions for optimization tasks involving millions of variables and constraints. The platform provides multiple interfaces, including C, Python, and server APIs, allowing developers to integrate optimization capabilities into applications and services. cuOpt is designed for high-performance environments and can be deployed across cloud, hybrid, or on-premise infrastructures. By combining GPU acceleration with scalable APIs, cuOpt enables organizations to solve large optimization challenges in logistics, operations research, and decision-making systems.

Downloads: 0 This Week

Last Update: 2026-04-09

See Project

Chandra

OCR model for complex documents with layout-aware structured outputs

...It is capable of handling over 40 languages and is optimized to read difficult inputs such as messy handwriting and multi-column layouts. Chandra can be run locally using transformer-based inference or deployed with a high-performance server setup for large-scale processing. It also includes command-line tools and optional web-based interfaces to simplify interaction and batch processing workflows.

Downloads: 3 This Week

Last Update: 2026-03-18

See Project

TensorRT Node for ComfyUI

Enables the best performance on NVIDIA RTX Graphics Cards

ComfyUI_TensorRT is an extension that lets ComfyUI run AI inference through NVIDIA’s TensorRT, aiming to get faster, more efficient execution on supported GPUs. It bridges the gap between ComfyUI’s flexible, node-based workflows and TensorRT’s highly optimized engine format. The result is that complex diffusion or image-processing graphs can be accelerated without the user having to rewrite the pipeline. The repo typically includes instructions for converting models to TensorRT engines and...

Downloads: 1 This Week

Last Update: 2025-10-30

See Project

proxy.py

Utilize all available CPU cores for accepting new client connections

proxy.py is made with performance in mind. By default, proxy.py will try to utilize all available CPU cores to it for accepting new client connections. This is achieved by starting AcceptorPool which listens on configured server port. Then, AcceptorPool starts Acceptor processes (--num-acceptors) to accept incoming client connections. Alongside, if --threadless is enabled, ThreadlessPool is setup which starts Threadless processes (--num-workers) to handle the incoming client connections. ...

Downloads: 0 This Week

Last Update: 2025-02-18

See Project

Arcade AI

Arcade Tool Development Kit (TDK), Worker, Evals, and CLI

...This repository contains the core Arcade libraries, organized as separate packages for maximum flexibility and modularity. Evaluation framework for testing tool performance. Test your MCP server's tools, resources, prompts, elicitation, and OAuth 2. MCPJam is compliant with the latest MCP specs. Connect to any MCP server. MCPJam inspector supports STDIO, SSE, and Streamable HTTP transports.

Downloads: 0 This Week

Last Update: 1 day ago

See Project

Search Results for "server performance"

Showing 95 open source projects for "server performance"

VikingDB MCP Server

Triton Inference Server

Google Workspace MCP Server

Lemonade

Text Embeddings Inference

FastHX

pg_activity

Agent360

OpenJarvis

waitress

FastSD CPU

socketify.py

ty

Text Generation Inference

Fantasy PL MCP

Scrapling

Maltrail

Sanic

asyncpg

django-health-check

NVIDIA cuOpt

Chandra

TensorRT Node for ComfyUI

proxy.py

Arcade AI

Search Results for "server performance"

Showing 95 open source projects for "server performance"

VikingDB MCP Server

Triton Inference Server

Google Workspace MCP Server

Lemonade

Text Embeddings Inference

FastHX

pg_activity

Agent360

OpenJarvis

waitress

FastSD CPU

socketify.py

ty

Text Generation Inference

Fantasy PL MCP

Scrapling

Maltrail

Sanic

asyncpg

django-health-check

NVIDIA cuOpt

Chandra

TensorRT Node for ComfyUI

proxy.py

Arcade AI

Related Searches

Related Categories