Showing 271 open source projects for "deploy"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    stt

    stt

    Voice Recognition to Text Tool

    ...It leverages open-source speech models such as Faster-Whisper to recognize and transcribe human speech into plain text, structured JSON objects, or subtitle files with time codes, making it suitable for both personal and professional transcription tasks. The project is designed to be easy to deploy: you can run a local Python server that exposes an HTTP API for uploading audio/video files and retrieving transcriptions in different formats. It supports GPU acceleration if available, enabling faster processing on compatible hardware but still offers reliable performance on CPUs alone.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenMLSys-ZH

    OpenMLSys-ZH

    Machine Learning Systems: Design and Implementation

    ...The repository includes scripts or tooling to keep translation synchronized with upstream changes, versioning, and possibly translation metadata (contributors, timestamp). Users can browse or clone the translated documentation to follow along with the original content, deploy examples, or understand system internals in their preferred language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Portia SDK Python

    Portia SDK Python

    Portia Labs Python SDK for building agentic workflows

    portia‑sdk‑python is an open-source Python SDK by Portia Labs for creating reliable, stateful, authenticated multi-agent AI workflows. It supports tool-backed agents capable of real-world interactions—like web browsing, API access, and human-in-the-loop clarifications—while maintaining transparency and auditability through structured plans and execution hooks. Designed for production environments, the SDK integrates with local or cloud LLMs (e.g. OpenAI, Anthropic, Mistral, Gemini) and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Pruna AI

    Pruna AI

    Pruna is a model optimization framework built for developers

    Pruna is an open-source, self-hostable AI inference engine designed to help teams deploy and manage large language models (LLMs) efficiently across private or hybrid infrastructures. Built with performance and developer ergonomics in mind, Pruna simplifies inference workflows by enabling multi-model orchestration, autoscaling, GPU resource allocation, and compatibility with popular open-source models. It is ideal for companies or teams looking to reduce reliance on external APIs while maintaining speed, cost-efficiency, and full control over their data and AI stack. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    LLM-Pruner

    LLM-Pruner

    On the Structural Pruning of Large Language Models

    LLM-Pruner is an open-source framework designed to compress large language models through structured pruning techniques while maintaining their general capabilities. Large language models often require enormous computational resources, making them expensive to deploy and inefficient for many practical applications. LLM-Pruner addresses this issue by identifying and removing non-essential components within transformer architectures, such as redundant attention heads or feed-forward structures. The framework relies on gradient-based analysis to determine which parameters contribute least to model performance, enabling targeted structural pruning rather than simple weight removal. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Stable Diffusion WebUI Docker

    Stable Diffusion WebUI Docker

    Easy Docker setup for Stable Diffusion with user-friendly UI

    Stable Diffusion WebUI Docker is a Docker-based repository that simplifies running Stable Diffusion with rich user interfaces by packaging multiple popular web UIs into an easy-to-deploy containerized solution. It integrates leading community UIs like AUTOMATIC1111 and ComfyUI into a Docker Compose setup that can be started with a single command, abstracting away dependency installation and environment configuration. Users can choose which UI profile they want to run — for example, full feature AUTOMATIC1111, CPU-only automatic builds, or ComfyUI workflows — and launch them in a consistent, isolated container environment with automatic model and data caching. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    OpenVINO Model Server

    OpenVINO Model Server

    A scalable inference server for models optimized with OpenVINO

    OpenVINO™ Model Server is a high-performance inference serving system designed to host and serve machine learning models that have been optimized with the OpenVINO toolkit. It’s implemented in C++ for scalability and efficiency, making it suitable for both edge and cloud deployments where inference workloads must be reliable and high throughput. The server exposes model inference via standard network protocols like REST and gRPC, allowing any client that speaks those protocols to request...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    VibeSDK

    VibeSDK

    Open source full-stack AI vibe coding platform & web app generator

    ...Users describe the application they want in natural language, and the system generates, previews, and deploys the resulting web app. It uses Cloudflare’s infrastructure (Workers, Containers, sandboxes). It can run untrusted code safely, provide live previews, and deploy apps at scale. VibeSDK gives you the exact methodology, tools, and confidence to turn your ideas into revenue-generating products, faster than you thought possible. Live preview of generated apps in isolated sandbox containers. Support for React + TypeScript + Tailwind generation, backend routing, and modern web stack scaffolding.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    ClearML

    ClearML

    Streamline your ML workflow

    ...The ClearML Server storing experiment, model, and workflow data, and supports the Web UI experiment manager, and ML-Ops automation for reproducibility and tuning. It is available as a hosted service and open source for you to deploy your own ClearML Server. The ClearML Agent for ML-Ops orchestration, experiment and workflow reproducibility, and scalability.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Harness

    Harness

    A meta-skill that designs domain-specific agent teams

    ...It enables structured collaboration between agents, allowing them to divide work and operate in coordinated pipelines. Harness also abstracts the complexity of agent design, making it easier for developers to deploy multi-agent systems without extensive configuration. Its approach emphasizes modularity and reuse, allowing generated agents and skills to be applied across different projects. Overall, Harness acts as an automation layer for building and managing complex agent ecosystems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Bindu

    Bindu

    Bindu: Turn any AI agent into a living microservice

    ...Once integrated, the agent gains a decentralized identity, standardized communication capabilities through protocols such as A2A and AP2, and built-in support for authentication and monetization. The system is designed to be framework-agnostic, meaning developers can build agents using tools like LangChain, OpenAI SDK, or custom implementations and still deploy them seamlessly. Bindu also introduces the concept of an “Internet of Agents,” where multiple specialized agents collaborate, discover each other, and exchange services autonomously.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Edgee

    Edgee

    AI gateway with token compression for Claude Code, Codex, and more

    Edgee is an edge-native execution platform designed to run AI-driven logic and data processing directly at the network edge, reducing latency and improving responsiveness for modern applications. It enables developers to deploy functions and workflows closer to users, allowing real-time processing without relying heavily on centralized cloud infrastructure. The platform is built to support event-driven architectures, where actions are triggered by incoming requests, user behavior, or external signals. It integrates AI capabilities into edge environments, making it possible to perform inference, personalization, and decision-making at the point of interaction. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    One API

    One API

    The LLM API management & distribution system

    ...One API also provides a web-based dashboard for managing keys, monitoring usage, and configuring routing rules. Its architecture is designed for scalability, allowing teams to deploy it as a centralized layer in their AI infrastructure. By abstracting the complexity of working with multiple APIs, it simplifies development and reduces vendor lock-in.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Instill Core

    Instill Core

    Instill Core is a full-stack AI infrastructure tool for data

    Instill Core is an open-source, full-stack AI infrastructure platform designed to orchestrate data pipelines, machine learning models, and unstructured data processing into a unified, production-ready system. It provides an end-to-end solution that enables developers to build, deploy, and manage AI-powered applications without needing to manually stitch together multiple tools across the data and model lifecycle. The platform focuses heavily on handling unstructured data such as documents, images, audio, and video, transforming them into AI-ready formats through integrated ETL pipelines and processing workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ZML

    ZML

    Any model. Any hardware. Zero compromise

    ...The system allows models to be compiled and executed across multiple types of accelerators, including GPUs and TPUs, even when distributed across different machines or locations. One of its key strengths is cross-compilation, enabling developers to build once and deploy across various platforms without rewriting code. zml provides example implementations of models and workflows, demonstrating how to run inference tasks such as image classification or large language models. It is designed to handle complex distributed setups, including scenarios where model components are split across devices connected via networks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Super Magic

    Super Magic

    All-in-one AI productivity platform with agents, workflows, and IM

    Magic is an open source all-in-one AI productivity platform designed to help organizations build, deploy, and scale AI-driven applications efficiently. It is not a single tool but a complete product ecosystem composed of multiple integrated systems that work together to enhance productivity across different business scenarios. Magic centers around a general-purpose AI agent system called Super Magic, which can autonomously understand tasks, plan actions, execute workflows, and perform error correction. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Kiln

    Kiln

    Open source platform for managing, testing, and deploying AI apps

    Kiln is an open source platform designed to help developers build, evaluate, and deploy AI-powered applications with greater structure and reliability. It provides a unified environment for managing prompts, datasets, and evaluation workflows, allowing teams to iterate on AI behavior in a controlled and measurable way. Kiln emphasizes reproducibility, enabling users to track changes to prompts and models while comparing outputs across different configurations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    NVIDIA NeMo Agent Toolkit

    NVIDIA NeMo Agent Toolkit

    Library for efficiently connecting and optimizing teams of AI agents

    ...NeMo Agent Toolkit also supports evaluation systems, prompt optimization, and reinforcement learning techniques to enhance agent behavior over time. By combining instrumentation, workflow orchestration, and performance optimization tools, the platform helps developers deploy scalable and intelligent multi-agent systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Cog

    Cog

    Package and deploy machine learning models using Docker containers

    Cog is an open source tool designed to package machine learning models into standardized, production-ready containers. It simplifies the process of deploying models by automatically generating Docker images based on a simple configuration file, eliminating the need to manually write complex Dockerfiles. Developers can define the runtime environment, dependencies, and Python versions required for their models, allowing Cog to build a consistent container environment that follows best...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    browserable

    browserable

    Open source and self-hostable browser automation library for AI agents

    ...Built primarily in JavaScript, the framework offers both a developer-friendly SDK and a REST API that allow integration with AI applications and automation pipelines. It is designed to be self-hostable, which means developers can deploy and run it on their own infrastructure without relying on third-party services. The platform enables the creation of browser-based agents capable of performing complex online workflows such as data collection, research tasks, and automated interactions with web platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    handy-ollama

    handy-ollama

    Implement CPU from scratch and play with large model deployments

    handy-ollama is an open-source educational project designed to help developers and AI enthusiasts learn how to deploy and run large language models locally using the Ollama platform. The repository serves as a structured tutorial that explains how to install, configure, and use Ollama to run modern language models on personal hardware without requiring advanced infrastructure. A key focus of the project is enabling users to run large models even without GPUs by leveraging optimized CPU-based inference pipelines. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    AI Engineering Transition Path

    AI Engineering Transition Path

    Research papers and blogs to transition to AI Engineering

    ...The project organizes resources that cover fundamental topics required to understand modern AI systems, including transformers, vector embeddings, tokenization, infrastructure design, and mixture-of-experts architectures. Instead of presenting isolated tutorials, the repository provides a structured pathway that guides engineers through the technical knowledge needed to build and deploy large language model systems. The materials include curated research papers, blog posts, and code examples that explain both theoretical foundations and practical implementation strategies. By consolidating these resources into a single repository, the project helps developers navigate the rapidly expanding AI ecosystem without needing to search through scattered materials.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    AgentScope

    AgentScope

    Build and run agents you can see, understand and trust

    AgentScope is a production-ready agent framework designed to help developers build, deploy, and scale intelligent agentic applications. It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    DeployStack

    DeployStack

    Centralized credential vault, governance, and token optimization

    DeployStack is an open-source framework that helps developers and teams define and deploy production infrastructure stacks using modular, reusable templates, often with IaC (infrastructure as code) principles. It provides a structured way to compose resources such as cloud networking, compute, and managed services into coherent deployment blueprints that can be versioned and reused across projects. By abstracting common deployment patterns and capturing them as templates, Deploystack reduces duplication of effort that typically occurs when setting up stacks for different applications or environments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    FastKoko

    FastKoko

    Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model

    FastKoko is a self-hosted text-to-speech server built around the Kokoro-82M model and exposed through a FastAPI backend. It is designed to be easy to deploy via Docker, with separate CPU and GPU images so that users can choose between pure CPU inference and NVIDIA GPU acceleration. The project exposes an OpenAI-compatible speech endpoint, which means existing code that talks to the OpenAI audio API can often be pointed at a Kokoro-FastAPI instance with minimal changes. It supports multiple languages and voicepacks and allows phoneme based generation for more accurate pronunciation and prosody. ...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB