Showing 246 open source projects for "deploy"

View related business solutions
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    seq2seq-couplet

    seq2seq-couplet

    Play couplet with seq2seq model

    ...It also supports serving the trained model through a web service, allowing users to interact with the system after training is complete. In addition to local execution, the project includes Docker files, which make it easier to package and deploy the application in a more reproducible way. The repository also points users to an external dataset source and documents vocabulary formatting requirements for custom datasets, showing that it is meant for both experimentation and extension.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    LLMChat

    LLMChat

    Unified interface for AI chat, Agentic workflows and more

    LLMChat is an open-source AI chat platform designed to provide a unified interface for interacting with multiple large language model providers while emphasizing privacy and advanced research capabilities. The system is built as a modern monorepo using technologies such as Next.js and TypeScript, enabling developers to deploy a full-featured web-based chatbot environment. One of its primary goals is to support sophisticated research workflows that combine conversational AI with information retrieval and reasoning tools. The platform includes specialized interaction modes such as deep research analysis and enhanced search capabilities that help users explore complex topics more effectively. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LLM-Pruner

    LLM-Pruner

    On the Structural Pruning of Large Language Models

    LLM-Pruner is an open-source framework designed to compress large language models through structured pruning techniques while maintaining their general capabilities. Large language models often require enormous computational resources, making them expensive to deploy and inefficient for many practical applications. LLM-Pruner addresses this issue by identifying and removing non-essential components within transformer architectures, such as redundant attention heads or feed-forward structures. The framework relies on gradient-based analysis to determine which parameters contribute least to model performance, enabling targeted structural pruning rather than simple weight removal. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    TaxHacker

    TaxHacker

    Self-hosted AI accounting app. LLM analyzer for receipts

    ...It integrates large language models to analyze these documents, extract relevant financial information, and categorize expenses or income based on configurable rules. Users can deploy the application on their own infrastructure, ensuring that financial data remains private and under their control rather than being processed by external services. The software provides tools for tracking income streams, monitoring expenses, and organizing financial records in a structured format. Because the system supports customizable prompts and categories, users can adapt the AI analysis to match their accounting workflows or tax requirements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    SmythOS

    SmythOS

    Cloud-native runtime for agentic AI

    ...It provides a foundational infrastructure layer that functions similarly to an operating system for agentic AI systems, managing resources such as language models, storage, vector databases, and caching through a unified interface. Developers can use the runtime to create, deploy, and orchestrate intelligent agents across local machines, cloud environments, or hybrid infrastructures without rewriting their application logic. The platform includes a software development kit and command-line interface that allow developers to define agent workflows, manage execution environments, and automate deployment processes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Generative AI Use Cases (GenU)

    Generative AI Use Cases (GenU)

    Application implementation with business use cases

    ...These examples cover tasks such as document analysis, conversational assistants, content generation, and knowledge retrieval systems. The repository is intended to serve as both a learning resource and a starting point for developers who want to deploy generative AI solutions using AWS infrastructure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Agent Development Kit (ADK) for Java

    Agent Development Kit (ADK) for Java

    An open-source, code-first Java toolkit

    Google’s Agent Development Kit for Java is an open-source toolkit that helps developers design, evaluate, and deploy advanced AI agents using the Java programming language. The framework follows a code-first approach that treats agent development as a structured software engineering task rather than a collection of prompt scripts. It provides abstractions and tools that allow developers to create agents capable of executing complex workflows, calling tools, and interacting with external services. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Paddler

    Paddler

    Open-source LLM load balancer and serving platform for hosting LLMs

    Paddler is an open-source LLM infrastructure platform designed to deploy, manage, and scale large language models on private infrastructure. The system acts as a specialized load balancer and serving layer for language models, enabling organizations to run inference workloads without relying on external API providers. It supports running models locally through engines such as llama.cpp while distributing requests across multiple compute nodes to improve performance and reliability. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    MaxText

    MaxText

    A simple, performant and scalable Jax LLM

    ...The framework focuses on simplicity while still supporting advanced techniques such as model sharding, distributed computation, and high-throughput training pipelines. MaxText includes ready-to-use configurations and reproducible training examples that help developers understand how to deploy large-scale AI workloads with modern machine learning infrastructure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Lagent

    Lagent

    A lightweight framework for building LLM-based agents

    Lagent is a lightweight open-source framework designed to help developers build autonomous agents powered by large language models. The framework provides tools and abstractions that allow language models to interact with external tools, execute tasks, and perform multi-step reasoning processes. Instead of using LLMs only for text generation, Lagent enables developers to transform models into agents capable of performing actions such as retrieving data, executing code, or interacting with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Agent Chat UI

    Agent Chat UI

    Web app for interacting with any LangGraph agent (PY & TS) via a chat

    ...Once connected, the interface enables real-time conversations where messages are sent to the agent and responses are streamed back to the chat interface. The project is designed to serve as a flexible frontend for agent-based AI systems, allowing developers to test and deploy conversational interfaces quickly. It also integrates with tools such as LangSmith for monitoring and debugging agent interactions during development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    II Agent

    II Agent

    A new open-source framework to build and deploy intelligent agents

    II-Agent is an open-source intelligent assistant framework designed to automate complex workflows across multiple domains using large language models and external tools. The platform allows users to interact with multiple AI models within a single environment while connecting those models to external services and knowledge sources. Through a unified interface, users can switch between models, access specialized tools, and execute tasks that require information retrieval, code execution, or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FastDeploy

    FastDeploy

    High-performance Inference and Deployment Toolkit for LLMs and VLMs

    ...Developed within the PaddlePaddle ecosystem, the toolkit focuses on providing high-performance deployment capabilities for modern AI models including large language models and vision-language systems. The platform enables developers to deploy trained models quickly using optimized inference pipelines that support GPUs, specialized AI accelerators, and other hardware architectures. FastDeploy includes advanced acceleration technologies such as speculative decoding, multi-token prediction, and efficient KV cache management to improve throughput and latency during inference. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Sandstorm

    Sandstorm

    One API call, pull Claude agent, completely sandboxed

    Sandstorm is an open-source project that wraps a powerful Claude-based AI agent within a completely sandboxed, ephemeral API service designed to make agentic AI workflows easy to deploy and scale without infrastructure complexity. The core idea is to provide “one API call” access to a robust Claude agent loop that runs inside a secure sandbox, so you can upload files, connect tools, and run long-running tasks — all managed behind a simple REST-style interface that disappears when the work is done. This approach lowers the friction of building autonomous agents by removing the need to provision servers, orchestrate distributed agents, or manage persistent tooling; agents can be spun up in parallel without manual setup and shut down when complete. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AI Agents Masterclass

    AI Agents Masterclass

    Follow along with my AI Agents Masterclass videos

    AI Agents Masterclass is an educational open-source repository designed to teach developers how to build, train, and deploy intelligent AI agents using modern tooling and workflow patterns. The project includes structured lessons, code examples, and practical exercises that cover foundational concepts like prompt engineering, chaining agents, tool usage, plan execution, evaluation, and safety considerations. It breaks down how autonomous agents interact with external systems, handle iterative reasoning, and integrate with third-party services or APIs to perform real tasks — for example, web search, browsing, scheduling, or coding assistance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DBHub

    DBHub

    Universal database MCP server connecting to MySQL, PostgreSQL

    DBHub is a universal database gateway that implements the MCP server interface so assistants and IDEs can explore and query databases through typed tools. It supports multiple transports—stdio for desktop clients and HTTP for networked scenarios—making it flexible to embed or deploy. Configuration is environment-variable driven, with a DSN and per-engine settings covering Postgres, MySQL, MariaDB, SQL Server, and SQLite. Operational flags include read-only mode, row limits, and even SSH tunneling options for secure access into private networks. A demo mode ships with an in-memory SQLite “employee” dataset so users can try the tools immediately without provisioning a database. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VibeSDK

    VibeSDK

    Open source full-stack AI vibe coding platform & web app generator

    ...Users describe the application they want in natural language, and the system generates, previews, and deploys the resulting web app. It uses Cloudflare’s infrastructure (Workers, Containers, sandboxes). It can run untrusted code safely, provide live previews, and deploy apps at scale. VibeSDK gives you the exact methodology, tools, and confidence to turn your ideas into revenue-generating products, faster than you thought possible. Live preview of generated apps in isolated sandbox containers. Support for React + TypeScript + Tailwind generation, backend routing, and modern web stack scaffolding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Superagent

    Superagent

    Superagent protects your AI applications

    Superagent is an open-source AI safety platform built to protect applications from prompt injections, data leaks, and harmful outputs. It embeds real-time safety directly into AI workflows, helping teams secure models before threats cause damage. Superagent provides guardrails that block jailbreaks, prompt manipulation, and sensitive data exfiltration. It includes redaction tools to remove PII, PHI, and secrets automatically from text. The platform also scans code repositories to detect...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Argilla

    Argilla

    The open-source data curation platform for LLMs

    Argilla is a production-ready framework for building and improving datasets for NLP projects. Deploy your own Argilla Server on Spaces with a few clicks. Use embeddings to find the most similar records with the UI. This feature uses vector search combined with traditional search (keyword and filter based). Argilla is free, open-source, and 100% compatible with major NLP libraries (Hugging Face transformers, spaCy, Stanford Stanza, Flair, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    OpenSumi

    OpenSumi

    A framework helps you quickly build Cloud or Desktop IDE products

    A framework helps you quickly build Cloud or Desktop IDE products. Integrate with your coding frameworks with ease. Support the container, Electron and front-end frameworks. Also help to ship and deploy quickly. Support VS Code plugins, OpenSumi plugins and OpenSumi modules to meet various business requirements. Customize the UI design in any way you like, no matter to simply configure the built-in UI, or develop a UI template, or build your own UI through plugins. OpenSumi framework aims to solve the redundant building problem of IDE product development within Alibaba, endeavours to fulfill IDE customization capabilities in more vertical scenarios and implement the shared underlying layer of Web and local clients, so that IDE development can move from the early "slash-and-burn" era to the "machine-based mass production" era.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MIVisionX

    MIVisionX

    Set of comprehensive computer vision & machine intelligence libraries

    MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX delivers highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions along with Convolution Neural Net Model Compiler & Optimizer supporting ONNX, and Khronos NNEF™ exchange formats. The toolkit allows for rapid prototyping and deployment of optimized computer vision and machine learning...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    RuoYi AI

    RuoYi AI

    Enterprise AI platform for building, deploying, and managing apps

    RuoYi AI is a full-stack enterprise-oriented AI development platform designed to help developers rapidly build, deploy, and manage intelligent applications using modern large language models and AI ecosystems. It provides a unified framework for integrating multiple AI models from different providers, allowing teams to switch or combine models through a consistent interface without vendor lock-in. RuoYi AI includes built-in support for retrieval-augmented generation, enabling organizations to create secure, private knowledge bases with high-accuracy search and reasoning capabilities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    AWS GenAI LLM Chatbot

    AWS GenAI LLM Chatbot

    A modular and comprehensive solution to deploy a Multi-LLM

    AWS GenAI LLM Chatbot is an enterprise-ready reference solution for deploying a secure, feature-rich generative AI chatbot on AWS with retrieval-augmented generation capabilities. The project is built as a modular blueprint that helps organizations stand up a production-oriented chat experience rather than a simple demo, combining model access, knowledge retrieval, storage, security, and user interface components into one deployable system. It supports multiple model providers and endpoints,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Farfalle

    Farfalle

    AI search engine - self-host with local or cloud LLMs

    ...The project integrates large language models with multiple search APIs so that the system can gather information from external sources and synthesize responses into concise answers. It can run either with local language models or with cloud-based providers, allowing developers to deploy it privately or integrate with hosted AI services. The architecture separates the frontend and backend, using modern web technologies such as Next.js and FastAPI to deliver an interactive interface and scalable server logic. Farfalle also includes an agent-based search workflow that plans queries and executes multiple search steps to produce more accurate results than traditional keyword searches. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    NVIDIA Generative AI Examples

    NVIDIA Generative AI Examples

    Generative AI reference workflows

    ...The repository includes examples covering topics such as retrieval-augmented generation pipelines, agent-based workflows, and multimodal AI applications that combine text, vision, and data processing. Many of the examples show how to deploy AI services using containerized environments, GPU acceleration, and microservices that can scale across modern infrastructure. Developers can explore sample chatbot applications, document question-answering systems, and knowledge-base pipelines that illustrate how generative AI can interact with external data sources.
    Downloads: 0 This Week
    Last Update:
    See Project