web server local free download

147 projects for "web server local" with 2 filters applied:

Artificial Intelligence BSD Clear Filters & Widen Search

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

WhatsApp MCP Server

WhatsApp MCP server enabling AI access to chats and messaging

whatsapp-mcp is an open source Model Context Protocol (MCP) server that enables AI agents to interact directly with a user’s WhatsApp account through a structured interface. It acts as a bridge between WhatsApp and large language models, allowing controlled access to messages, chats, and contacts. whatsapp-mcp is composed of two main components: a Go-based bridge that connects to the WhatsApp Web API and stores data locally, and a Python-based MCP server that exposes tools for AI interaction. ...

Downloads: 1 This Week

Last Update: 2026-03-17
See Project
2

Markdownify MCP Server

Convert files and web content into clean, usable Markdown easily

Markdownify MCP is a Model Context Protocol server that converts many types of files and web content into clean Markdown. It supports formats such as PDFs, images, audio with transcription, DOCX, XLSX, and PPTX, along with web sources like YouTube transcripts, Bing results, and general webpages. Markdownify MCP is designed to simplify content extraction and make data easier to read, share, and reuse in structured workflows.

Downloads: 2 This Week

Last Update: 3 days ago
See Project
3

AingDesk

AI assistant that supports knowledge bases, model APIs

AingDesk is an open-source desktop and server-based AI assistant platform designed to provide a user-friendly environment for interacting with language models and building AI-powered tools. The software enables users to run local AI models or connect to external model APIs through a unified interface. One of its primary goals is to simplify the process of building knowledge-based assistants by allowing users to create local knowledge bases that the AI can search and analyze. ...

Downloads: 1 This Week

Last Update: 2026-03-06
See Project
4

RAG Web UI

RAG Web UI is an intelligent dialogue system based on RAG

RAG Web UI is an open-source intelligent dialogue system built on retrieval-augmented generation technology, designed to enable users to create AI-powered question answering systems grounded in their own knowledge bases. It combines document retrieval with large language models to provide accurate, context-aware responses based on indexed data rather than generic model knowledge. The platform supports ingestion of multiple document formats, including PDFs, Word files, Markdown, and plain...

Downloads: 0 This Week

Last Update: 2026-04-06
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

Browserbase MCP Server

Allow LLMs to control a browser with Browserbase and Stagehand

Browserbase MCP Server is a server implementation of the Model Context Protocol (MCP) that enables large language models to interact with web browsers programmatically through cloud-based automation. The project provides a standardized interface for connecting AI systems to real-world web environments, allowing them to navigate pages, extract structured data, and perform user-like actions such as clicking, typing, and form submission.

Downloads: 0 This Week

Last Update: 2026-03-31
See Project
6

BrowserAI

Run local LLMs like llama, deepseek, kokoro etc. inside your browser

BrowserAI is a cutting-edge platform that allows users to run large language models (LLMs) directly in their web browser without the need for a server. It leverages WebGPU for accelerated performance and supports offline functionality, making it a highly efficient and privacy-conscious solution. The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines.

Downloads: 7 This Week

Last Update: 4 days ago
See Project
7

shimmy

Python-free Rust inference server

The shimmy project is a lightweight local inference server designed to run large language models with minimal overhead. Written primarily in Rust, the tool provides a small standalone binary that exposes an API compatible with the OpenAI interface, allowing existing applications to interact with local models without significant code changes. This compatibility enables developers to replace remote AI services with locally hosted models while keeping their existing software architecture intact. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
8

Shadcn UI v4 MCP Server

A mcp server to allow LLMS gain context about shadcn ui component

...The server supports multiple frontend frameworks including React, Svelte, Vue, and React Native, making it highly versatile for cross-platform development. It includes smart caching and efficient GitHub API usage to optimize performance and handle rate limits during component retrieval. The system also supports multiple transport modes such as standard input/output and Server-Sent Events, enabling both local and distributed deployments.

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
9

Secret Llama

Fully private LLM chatbot that runs entirely with a browser

Secret Llama is a privacy-first large-language-model chatbot that runs entirely inside your web browser, meaning no server is required and your conversation data never leaves your device. It focuses on open-source model support, letting you load families like Llama and Mistral directly in the client for fully local inference. Because everything happens in-browser, it can work offline once models are cached, which is helpful for air-gapped environments or travel. ...

Downloads: 1 This Week

Last Update: 2025-11-07
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

PentestGPT

Automated Penetration Testing Agentic Framework Powered by LLMs

...Built with a modular and extensible architecture, PentestGPT supports cloud and local LLMs, making it suitable for research, education, and authorized security testing.

Downloads: 376 This Week

Last Update: 2025-12-24
See Project
11

Harbor LLM

Run a full local LLM stack with one command using Docker

...It also includes tools for web retrieval, image generation, voice interaction, and workflow automation. Built on Docker, Harbor allows services to run in isolated containers while communicating over a local network. It is intended for local development and experimentation rather than production deployment, giving developers a flexible way to explore AI systems, test configurations, and manage complex LLM stacks without manual wiring or setup overhead.

Downloads: 4 This Week

Last Update: 5 days ago
See Project
12

NextJS Ollama LLM UI

Fully-featured web interface for Ollama LLMs

NextJS Ollama LLM UI is a web-based frontend interface built with Next.js to make interacting with Ollama-hosted large language models easy and fast. Its goal is to remove the complexity of setting up and managing UI components for local or offline LLM usage by providing a straightforward chat experience with support for responsive layouts, light and dark themes, and local chat history storage in the browser.

Downloads: 0 This Week

Last Update: 2026-01-29
See Project
13

kMCP

Kubernetes Controller for building, testing and deploying MCP servers

KMCP is a companion toolchain for building, testing, and deploying MCP servers with a workflow that spans local development through Kubernetes production deployments. It includes a CLI for day-to-day development tasks like scaffolding new MCP projects, managing tools, building container images, and running an MCP server locally for validation. For cluster operations, it includes a Kubernetes controller that manages MCP server lifecycles using a dedicated Custom Resource Definition (CRD), allowing MCP servers to be represented as native Kubernetes objects you can operate with familiar kubectl-driven patterns. ...

Downloads: 1 This Week

Last Update: 2026-03-30
See Project
14

Sled

Teleport Claude Code, Codex or Gemini CLI to your phone

Sled under the layercodedev account appears to be a lightweight web and mobile UI aimed at interacting with local coding agents, likely including AI-assisted coding models or remote execution integrations; it’s designed in TypeScript/JavaScript and intended to let developers use or control coding agents from various devices. Although specific details in the repository are limited without direct project documentation, context and related online mentions indicate it functions as a local interface layer that abstracts development agent workflows and Teleport-style interactions, bringing parts of modern assistant capabilities to phone or web UIs. ...

Downloads: 0 This Week

Last Update: 2026-01-26
See Project
15

Agent Chat UI

Web app for interacting with any LangGraph agent (PY & TS) via a chat

Agent Chat UI is an open-source web application that provides a graphical interface for interacting with AI agents built using LangGraph and related frameworks. The project is implemented as a modern Next.js application and allows users to chat with agent workflows running on remote or local LangGraph servers. Through a simple configuration process, developers can connect the interface to a deployed agent by specifying the server URL, assistant identifier, and authentication credentials. ...

Downloads: 0 This Week

Last Update: 2026-03-31
See Project
16

mcpo

A simple, secure MCP-to-OpenAPI proxy server

mcpo is a minimal bridge that exposes any MCP tool as an OpenAPI-compatible HTTP server. Instead of writing glue code, you point mcpo at an MCP server command and it generates REST endpoints and an OpenAPI spec that other systems (or LLM agent frameworks) can call immediately. This design lets you reuse a growing library of MCP servers with platforms that only understand HTTP+OpenAPI, unifying tool access across ecosystems.

Downloads: 0 This Week

Last Update: 2026-02-27
See Project
17

Pocket TTS

A TTS that fits in your CPU (and pocket)

Pocket TTS is a lightweight text-to-speech project designed to run efficiently on CPUs, targeting developers who want local speech generation without depending on GPUs or hosted web APIs. It is built to feel practical in everyday applications, where installation and usage should be as simple as adding a dependency and calling a function. The project focuses on keeping the runtime footprint manageable while still producing natural-sounding speech, which makes it attractive for offline tools, prototypes, and privacy-sensitive workflows. ...

Downloads: 8 This Week

Last Update: 2026-02-16
See Project
18

Farfalle

AI search engine - self-host with local or cloud LLMs

...The project integrates large language models with multiple search APIs so that the system can gather information from external sources and synthesize responses into concise answers. It can run either with local language models or with cloud-based providers, allowing developers to deploy it privately or integrate with hosted AI services. The architecture separates the frontend and backend, using modern web technologies such as Next.js and FastAPI to deliver an interactive interface and scalable server logic. Farfalle also includes an agent-based search workflow that plans queries and executes multiple search steps to produce more accurate results than traditional keyword searches. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
19

Colab-MCP

An MCP server for interacting with Google Colab

...This approach bridges the gap between local AI agents and remote high-performance compute environments, allowing users to offload heavy workloads such as machine learning training, data analysis, and dependency-heavy tasks to Colab’s GPU and TPU resources. By exposing Colab as an MCP server, the tool enables seamless integration with a wide range of AI assistants and agent frameworks, creating a standardized interface for tool use and execution.

Downloads: 0 This Week

Last Update: 2026-03-27
See Project
20

wllama

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

wllama is a WebAssembly-based library that enables large language model inference directly inside a web browser. Built as a binding for the llama.cpp inference engine, the project allows developers to run LLM models locally without requiring a server backend or dedicated GPU hardware. The library leverages WebAssembly SIMD capabilities to achieve efficient execution within modern browsers while maintaining compatibility across platforms.

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
21

MCP Proxy

A TypeScript SSE proxy for MCP servers that use stdio transport

mcp-proxy is a lightweight bridge that converts between MCP transports, letting you run a server on stdio and expose it over Streamable HTTP (SSE) or do the reverse. This enables existing desktop-style MCP servers to be reused by web services and IDEs that prefer HTTP, without modifying the server. The tool can multiplex multiple named STDIO servers behind one proxy instance, simplifying fleet deployments or local development with many tools.

Downloads: 5 This Week

Last Update: 2026-04-13
See Project
22

Every Code

Local AI coding agent CLI with multi-agent orchestration tools

Every Code (often referred to simply as Code) is a fast, local AI-powered coding agent designed to run directly in the terminal environment. It is a community-driven fork of the Codex CLI, with a strong emphasis on improving real-world developer ergonomics and workflows. Every Code enhances the traditional coding assistant model by introducing multi-agent orchestration, allowing multiple AI agents to collaborate, compare solutions, and refine outputs in parallel. It supports integration with...

Downloads: 20 This Week

Last Update: 4 days ago
See Project
23

WhisperLive

A nearly-live implementation of OpenAI's Whisper

WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and...

Downloads: 15 This Week

Last Update: 2026-03-17
See Project
24

Browser MCP

Browser MCP is a Model Context Provider (MCP) server

...The server exposes structured tools for navigation, element interaction, and artifact capture (DOM, screenshots, logs), all discoverable via MCP schemas. Because it runs against the user’s primary browser, it’s well-suited to repetitive web tasks, authenticated dashboards, and debugging workflows inside MCP-capable IDEs. A public website and extension streamline installation and connect the local server to clients like Claude, Cursor, VS Code, and Windsurf. ...

Downloads: 1 This Week

Last Update: 2025-10-08
See Project
25

KnowNote

A local-first AI knowledge base & NotebookLM alternative

KnowNote is a local-first, open-source AI knowledge base and notebook application created as an Electron-based alternative to Google NotebookLM that emphasizes privacy, control, and simplicity. It lets users build an intelligent, searchable knowledge base from uploaded documents such as PDFs, Word files, PowerPoints, and web pages, and then interact with that content using LLM-powered chat, summarization, and reasoning tools.

Downloads: 3 This Week

Last Update: 2026-01-30
See Project