Showing 151 open source projects for "api server"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Antigravity Claude Proxy

    Antigravity Claude Proxy

    Proxy that exposes Antigravity provided claude / gemini models

    Antigravity Claude Proxy is a purpose-built proxy server that enables developers to interface with Claude models through a standardized RESTful API, allowing tools and workflows that expect generic HTTP APIs to operate on Anthropic’s Claude without native support. The project acts as a translation layer, receiving web requests in common formats (such as OpenAI-style endpoints) and forwarding them to Anthropic’s API in the required structure, while converting responses back into a familiar shape. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    ACI.dev

    ACI.dev

    Open platform connecting AI agents to tools via unified MCP server

    ACI is an open source platform designed to enable AI agents to interact with external tools through a unified and structured interface. It focuses on simplifying tool integration by connecting hundreds of pre-built services into agentic environments, allowing developers to avoid building custom API clients and authentication flows for each service. ACI provides intent-aware tool access, meaning agents can dynamically discover and use tools based on context rather than rigid configurations. It supports both direct function calling and a unified Model Context Protocol (MCP) server, offering flexibility in how integrations are exposed to AI systems. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    AniList MCP

    AniList MCP

    AniList MCP server for accessing anime and manga data

    An MCP server that interfaces with the AniList API, allowing AI clients to access and interact with anime, manga, character, staff, and user data from AniList. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    BrowserAI

    BrowserAI

    Run local LLMs like llama, deepseek, kokoro etc. inside your browser

    BrowserAI is a cutting-edge platform that allows users to run large language models (LLMs) directly in their web browser without the need for a server. It leverages WebGPU for accelerated performance and supports offline functionality, making it a highly efficient and privacy-conscious solution. The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines. Additionally, it supports features such as...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Fli

    Fli

    Google Flights MCP and Python Library

    ...In addition to its CLI interface, fli includes a Model Context Protocol (MCP) server that allows AI assistants to interact with flight data using structured tools, enabling natural language queries and automation workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Pinchtab

    Pinchtab

    High-performance browser automation bridge and orchestrator

    Pinchtab is a lightweight browser automation backend built specifically for AI agents that need efficient, programmatic web control. Implemented as a small standalone HTTP server, it allows any agent or script to interact with web pages using simple API calls instead of heavyweight browser frameworks. The tool emphasizes accessibility-first snapshots that dramatically reduce token usage compared to screenshot-based approaches, making it cost-effective for large-scale automation. It launches and manages its own Chrome instance while remaining framework-agnostic, so it can be used with any language or agent system. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    shimmy

    shimmy

    Python-free Rust inference server

    The shimmy project is a lightweight local inference server designed to run large language models with minimal overhead. Written primarily in Rust, the tool provides a small standalone binary that exposes an API compatible with the OpenAI interface, allowing existing applications to interact with local models without significant code changes. This compatibility enables developers to replace remote AI services with locally hosted models while keeping their existing software architecture intact. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    webclaw

    webclaw

    Fast, local-first web content extraction for LLMs

    ...The tool addresses a major inefficiency in AI workflows by removing irrelevant elements like navigation menus, ads, and scripts, significantly reducing token usage when feeding data into language models. It supports multiple modes of operation, including CLI usage, REST API access, and an MCP server for direct integration with agent-based systems. Webclaw also provides advanced capabilities such as recursive crawling, structured JSON extraction, summarization, and content comparison, making it suitable for research and data pipelines. Its local-first architecture ensures privacy and eliminates the need for API keys.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Octelium

    Octelium

    A next-gen FOSS self-hosted unified zero trust secure access platform

    ...The project also highlights self-hosted, no hidden “server-side” locked components, giving organizations greater ownership and control over access, rather than relying on proprietary SaaS.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Infinity

    Infinity

    Low-latency REST API for serving text-embeddings

    Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting all sentence-transformer models and frameworks. Infinity is developed under MIT License. Infinity powers inference behind Gradient.ai and other Embedding API providers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    mistral.rs

    mistral.rs

    Fast, flexible LLM inference

    mistral.rs is a fast and flexible LLM inference engine implemented in Rust, designed to run and serve modern language models with an emphasis on performance and practical deployment. It provides multiple entry points for developers, including a CLI for running models locally and an HTTP server that exposes an OpenAI-compatible API surface for easy integration with existing clients. The project includes hardware-aware tooling that can benchmark a system and choose sensible quantization and device-mapping strategies, helping users get strong performance without manual tuning. It also supports serving multiple models from the same server process, enabling routing or quick switching between models depending on workload needs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    MCP Golang

    MCP Golang

    Write Model Context Protocol servers in few lines of go code

    mcp-golang is an unofficial Go implementation of the Model Context Protocol (MCP), allowing developers to write MCP servers and clients with minimal code. It aims to simplify the development process by providing a straightforward API for integrating MCP functionalities into Go applications. Comprehensive documentation is available to assist developers in getting started. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Context7 Platform

    Context7 Platform

    Up-to-date code documentation for LLMs and AI code editors

    Context7 is a system that aims to inject fresh, version-specific documentation and code snippets into language model prompts, thereby avoiding reliance on outdated training data or hallucinated APIs. It’s designed to integrate with tools that support the Model Context Protocol (MCP), such as Cursor, Windsurf, and other LLM clients. When a user writes a prompt and appends something like “use context7,” the system detects the libraries or frameworks being asked about, fetches the latest...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    ort

    ort

    Fast ML inference & training for ONNX models in Rust

    ort is a high-performance Rust library that provides bindings to ONNX Runtime, enabling developers to run machine learning inference and training workflows directly within Rust applications using the standardized ONNX model format. It is designed to bridge the gap between modern machine learning frameworks and systems programming by offering a safe, ergonomic API for executing models originally built in ecosystems like PyTorch, TensorFlow, or scikit-learn. The library emphasizes speed and efficiency, leveraging hardware acceleration across CPUs, GPUs, and specialized accelerators to deliver low-latency inference both on-device and in server environments. One of its key strengths is its flexibility, as it supports multiple backends and allows developers to configure execution providers depending on available hardware. ort also includes advanced capabilities such as model compilation and optimization, reducing startup time and improving runtime performance in production systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Model Context Protocol (MCP) Servers

    Model Context Protocol (MCP) Servers

    Model Context Protocol Servers

    The Model Context Protocol (MCP) is an open protocol that enables seamless integration between LLM applications and external data sources and tools. Whether you’re building an AI-powered IDE, enhancing a chat interface, or creating custom AI workflows, MCP provides a standardized way to connect LLMs with the context they need.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Pocket TTS

    Pocket TTS

    A TTS that fits in your CPU (and pocket)

    ...The project focuses on keeping the runtime footprint manageable while still producing natural-sounding speech, which makes it attractive for offline tools, prototypes, and privacy-sensitive workflows. Because it is CPU-oriented, it fits well in server environments where GPU access is limited, in desktop apps, or in edge deployments where simplicity matters more than maximum throughput. It also emphasizes developer ergonomics, providing a straightforward API surface that can be integrated into pipelines, assistants, accessibility tools, or batch generation scripts.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    Grounded Docs

    Grounded Docs

    Open-Source Alternative to Context7, Nia, and Ref.Tools

    Grounded Docs is an open-source implementation of a Model Context Protocol server designed to expose documentation and structured information as tools that AI agents can query. The project allows language models and agent frameworks to retrieve and interact with documentation through standardized MCP interfaces. By acting as an intermediary layer between documentation sources and AI tools, the server enables models to access structured documentation in a consistent and machine-readable...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    stt

    stt

    Voice Recognition to Text Tool

    ...It leverages open-source speech models such as Faster-Whisper to recognize and transcribe human speech into plain text, structured JSON objects, or subtitle files with time codes, making it suitable for both personal and professional transcription tasks. The project is designed to be easy to deploy: you can run a local Python server that exposes an HTTP API for uploading audio/video files and retrieving transcriptions in different formats. It supports GPU acceleration if available, enabling faster processing on compatible hardware but still offers reliable performance on CPUs alone.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    ElatoAI

    ElatoAI

    Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP

    ElatoAI is a real-time AI voice agent platform built around IoT hardware (ESP32) that enables continuous speech-to-speech conversations using state-of-the-art multimodal voice models with minimal latency and global performance via edge computing. The system integrates voice synthesis and recognition by connecting an ESP32 device through secure WebSockets to edge server functions written in Deno, allowing users to speak naturally with AI agents hosted through cloud APIs including OpenAI’s Realtime API, Gemini’s Live API, xAI’s Grok Voice Agent API, and others. It includes a web client (built with Next.js) for managing devices, controlling volume, and viewing conversation transcripts, while the hardware runs optimized firmware to deliver responses in near real time — even supporting >15-minute uninterrupted conversations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Speech-AI-Forge

    Speech-AI-Forge

    Speech-AI-Forge is a project developed around TTS generation model

    Speech-AI-Forge is a full-stack project built around modern text-to-speech generation models, providing both an API server and a Gradio-based web UI for interactive use. At its core, it acts as a hub that wires together multiple speech-related capabilities, including TTS, speech-to-text and LLM-based control flows, behind a consistent interface. The system is designed to be deployed in several ways: you can try it online via hosted demos, spin it up in a one-click Colab environment, run it in Docker containers, or set it up locally with its environment preparation scripts. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    n8n-MCP

    n8n-MCP

    A MCP for Claude Desktop / Claude Code / Windsurf / Cursor

    n8n-mcp is a Model Context Protocol (MCP) server that turns the n8n workflow platform into a set of first-class, typed tools an AI assistant can understand and operate. It exposes structured knowledge of n8n nodes and operations so an agent can reason about workflows, parameters, and executions without scraping docs or guessing API shapes. The server focuses on making Claude Desktop (and other MCP-capable clients) “n8n-literate,” enabling tasks such as inspecting existing workflows, proposing node chains, and validating configuration before runs. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22
    DiscordGo

    DiscordGo

    (Golang) Go bindings for Discord

    DiscordGo is a Go package that provides low level bindings to the Discord chat client API. DiscordGo has nearly complete support for all of the Discord API endpoints, websocket interface, and voice interface. If you would like to help the DiscordGo package please use this link to add the official DiscordGo test bot dgo to your server. This provides indispensable help to this project. Construct a new Discord client which can be used to access the variety of Discord API functions and to set callback functions for Discord events. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Ito

    Ito

    Ito, smart dictation in every application

    ito is an open‑source JavaScript library for serverless, browser‑to‑browser communication designed for use on devices with or without user input interfaces, such as IoT devices, mobile devices, tablets, and desktops, enabling peer messaging and data sharing via short passcodes and cloud‑backed pairing without an application server.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    5ire

    5ire

    5ire is a cross-platform desktop AI assistant, MCP client

    5ire is a sleek, cross‑platform desktop AI assistant and MCP client that connects to major service providers, supports a local knowledge base and tool integration via MCP servers, enabling robust RAG and assistant features. These components are required as they constitute the runtime environment for the MCP Server. If you don't anticipate using the tools feature immediately, you may choose to skip this installation step and complete it later when the need arises. MCP is an open protocol that...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    MCP OpenAI

    MCP OpenAI

    Chat with OpenAI models from Claude Desktop

    The MCP OpenAI Server is a Model Context Protocol server that allows seamless interaction with OpenAI's models directly from applications like Claude Desktop. It simplifies the integration of OpenAI's language models into various workflows. ​
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB