Showing 271 open source projects for "deploy"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Better Chatbot

    Better Chatbot

    Just a Better Chatbot. Powered by MCP Client & Workflows

    Better‑chatbot is an AI chatbot framework powered by MCP protocols and workflows, allowing developers to deploy and integrate AI-powered chat systems with ease. Integrates all major LLMs: OpenAI, Anthropic, Google, xAI, Ollama, and more. MCP protocol, web search, JS/Python code execution, data visualization. Custom agents, visual workflows, artifact generation. Custom agents, visual workflows, artifact generation. Realtime voice chat with full MCP tool integration.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Torch-TensorRT

    Torch-TensorRT

    PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

    Torch-TensorRT is a compiler for PyTorch/TorchScript, targeting NVIDIA GPUs via NVIDIA’s TensorRT Deep Learning Optimizer and Runtime. Unlike PyTorch’s Just-In-Time (JIT) compiler, Torch-TensorRT is an Ahead-of-Time (AOT) compiler, meaning that before you deploy your TorchScript code, you go through an explicit compile step to convert a standard TorchScript program into a module targeting a TensorRT engine. Torch-TensorRT operates as a PyTorch extension and compiles modules that integrate into the JIT runtime seamlessly. After compilation using the optimized graph should feel no different than running a TorchScript module. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    DeepCamera

    DeepCamera

    Open-Source AI Camera. Empower any camera/CCTV

    ...It provides open-source facial recognition-based intrusion detection, fall detection, and parking lot monitoring with the inference engine on your local device. SharpAI-hub is the cloud hosting for AI applications that helps you deploy AI applications with your CCTV camera on your edge device in minutes. SharpAI yolov7_reid is an open-source Python application that leverages AI technologies to detect intruders with traditional surveillance cameras. The source code is here It leverages Yolov7 as a person detector, FastReID for person feature extraction, Milvus the local vector database for self-supervised learning to identify unseen persons, Labelstudio to host images locally and for further usage such as label data and train your own classifier. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    CoPaw

    CoPaw

    Your Personal AI Assistant; easy to install, deploy on local or coud

    CoPaw is a personal AI assistant designed to run on your own machine or in the cloud, giving you full control over memory, models, and data. Built by the AgentScope team, it connects to multiple chat platforms—including DingTalk, Feishu, QQ, Discord, iMessage, and more—through a single unified assistant. CoPaw supports both cloud-based LLM providers and fully local models such as llama.cpp, MLX, and Ollama, allowing you to operate without API keys if preferred. It includes a browser-based...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 5
    LitGPT

    LitGPT

    20+ high-performance LLMs with recipes to pretrain, finetune at scale

    LitGPT is a collection of over 20 high-performance large language models (LLMs) accompanied by recipes to pretrain, finetune, and deploy them at scale. It provides implementations without abstractions, making it beginner-friendly while offering advanced features like flash attention and support for various precision levels. LitGPT is designed to run efficiently across multiple GPUs or TPUs, catering to both small-scale and large-scale deployments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Eliza

    Eliza

    Autonomous agents for everyone

    Build and deploy autonomous AI agents with consistent personalities across Discord, Twitter, and Telegram. Full support for voice, text, and media interactions. Built-in RAG memory system, document processing, media analysis, and autonomous trading capabilities. Supports multiple AI models including Llama, GPT-4, and Claude. Create custom actions, add new platform integrations, and extend functionality through a modular plugin system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Prompt flow

    Prompt flow

    Build high-quality LLM apps

    Prompt flow is a suite of development tools designed to streamline the end-to-end development cycle of LLM-based AI applications, from ideation, prototyping, testing, and evaluation to production deployment and monitoring. It makes prompt engineering much easier and enables you to build LLM apps with production quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Mastra

    Mastra

    The TypeScript AI agent framework

    Mastra is a TypeScript-first framework for building AI-powered applications and agents, designed to take projects from prototype to production on a modern JavaScript/TypeScript stack. It integrates cleanly with React, Next.js, and Node-based backends, but can also run as a standalone server, giving teams flexibility in how they deploy their AI logic. At its core, Mastra provides abstractions for agents, workflows, tools, memory, retrieval, and model routing, so developers can focus on specifying behavior rather than wiring infrastructure from scratch. Model routing lets you connect to dozens of providers (OpenAI, Anthropic, Gemini, and others) through a single standardized interface, while agents orchestrate LLM calls and tools to solve open-ended tasks with internal reasoning loops. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Chitu

    Chitu

    High-performance inference framework for large language models

    Chitu is a high-performance inference engine designed to deploy and run large language models efficiently in production environments. The framework focuses on improving efficiency, flexibility, and scalability for organizations that need to run LLM inference workloads across different hardware platforms. It supports heterogeneous computing environments, including CPUs, GPUs, and various specialized AI accelerators, allowing models to run across a wide range of infrastructure configurations. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    GPUStack

    GPUStack

    Performance-optimized AI inference on your GPUs

    ...The platform supports GPUs from a wide range of vendors and can run on laptops, workstations, and servers across operating systems such as macOS, Windows, and Linux. It also enables developers to deploy models from common repositories like Hugging Face and access them through APIs similar to cloud-based AI services.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Jovo Framework

    Jovo Framework

    The React for Voice and Chat, build apps for Alexa, Google Assistant

    ...Our component and plugin architecture makes it possible to make Jovo work for your specific use case, across projects. The Jovo Debugger and Test Suite enable you to build robust experiences that behave exactly how you want them to. The Jovo CLI makes it possible to deploy your project to different environments using stage specific configurations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    diff2html

    diff2html

    Pretty diff to html javascript library (diff2html)

    ...Similar lines are paired, allowing for easier change tracking. We work hard to make sure you can have your diffs in a simple and flexible way. The AI community building the future. Build, train and deploy state of the art models powered by the reference open source in natural language processing. Wrapper and helper adding syntax highlight, synchronized scroll, and other nice features. You can use it without syntax highlight or by passing your own implementation with the languages you prefer. Diff2Html can be used in various ways as listed in the distributions section.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Rill

    Rill

    Fast SQL-based BI tool for real-time dashboards and analytics

    ...Rill supports local and remote data sources such as CSV, Parquet, S3, and GCS, making it flexible across environments. Its BI-as-code model combines SQL, YAML configuration, Git version control, and CLI tools, allowing teams to build, manage, and deploy analytics workflows efficiently. Automatic data profiling and responsive query updates help users understand datasets instantly. Interactive dashboards come with opinionated defaults, so teams can focus on insights instead of setup, while metrics layers standardize business logic for consistent reporting across dashboards, APIs, and AI systems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Open Agents

    Open Agents

    An open source template for building cloud agents

    ...It emphasizes openness and interoperability, making it easier to integrate with different models, APIs, and external systems. The project also includes examples and templates that demonstrate how to build and deploy agents for real-world applications. By prioritizing composability, it allows developers to combine simple components into more complex agent systems. Overall, open-agents serves as a playground for building and experimenting with next-generation AI agent architectures.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Cognita

    Cognita

    Open source RAG framework for building scalable modular AI apps

    Cognita is an open source framework designed to help developers build, organize, and deploy Retrieval-Augmented Generation (RAG) applications in a structured and production-ready way. It addresses the gap between quick experimentation in notebooks and the complexity of deploying scalable AI systems by introducing a modular and API-driven architecture. Cognita provides reusable components such as parsers, data loaders, embedders, retrievers, and query controllers, allowing teams to customize each stage of the RAG pipeline independently. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    LangBot

    LangBot

    Production-grade platform for building agentic IM bots

    LangBot is an open source platform designed to build and deploy AI-powered chatbots across multiple instant messaging ecosystems. The system allows developers to integrate large language models into messaging platforms so that bots can perform tasks, answer questions, and automate workflows directly within everyday communication tools. It supports numerous messaging services including Discord, Slack, Telegram, WeChat, and other enterprise communication systems, making it a flexible solution for both personal projects and organizational deployments. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Tribe AI

    Tribe AI

    Low code tool to rapidly build and coordinate multi-agent teams

    Low code tool to rapidly build and coordinate multi-agent teams. Have you heard the saying, 'Two minds are better than one'? That's true for agents too. Tribe leverages on the langgraph framework to let you customize and coordinate teams of agents easily. By splitting up tough tasks among agents who are good at different things, each one can focus on what it does best. This makes solving problems faster and better.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    NemoClaw

    NemoClaw

    NVIDIA plugin for secure installation of OpenClaw

    ...The platform integrates with AI models such as NVIDIA Nemotron and supports multiple inference backends including cloud APIs, local NIM deployments, and vLLM. Through its command-line interface, developers can deploy, monitor, and manage AI assistants running inside isolated sandboxes. By combining sandbox orchestration, agent management, and AI model integration, NemoClaw provides a secure foundation for building and operating autonomous AI assistants.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    agentic-stack

    agentic-stack

    One brain, many harnesses. Portable .agent/ folder

    agentic-stack is a framework or toolkit designed to build, orchestrate, and deploy AI agents in a structured and scalable way. It likely provides components for managing agent workflows, communication, and task execution across different systems. The project emphasizes modularity, enabling developers to assemble custom pipelines using various AI models, tools, and APIs. It may include abstractions for memory, planning, and tool usage, reflecting modern agentic AI design patterns. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Lecca.io

    Lecca.io

    Lecca.io | AI Agents & Automations

    Lecca.io is an AI platform that allows you to configure and deploy Large Language Models (LLMs) equipped with powerful tools and workflows. Build, customize, and automate your AI agents with ease.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    smolagents

    smolagents

    Agents write python code to call tools and orchestrate other agents

    ...We provide our definition in this page, where you’ll also find tips for when to use them or not (spoilers: you’ll often be better off without agents). smolagents is a lightweight framework for building AI agents using large language models (LLMs). It simplifies the development of AI-driven applications by providing tools to create, train, and deploy language model-based agents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TorchServe

    TorchServe

    Serve, optimize and scale PyTorch models in production

    TorchServe is a performant, flexible and easy-to-use tool for serving PyTorch eager mode and torschripted models. Multi-model management with the optimized worker to model allocation. REST and gRPC support for batched inference. Export your model for optimized inference. Torchscript out of the box, ORT, IPEX, TensorRT, FasterTransformer. Performance Guide: built-in support to optimize, benchmark and profile PyTorch and TorchServe performance. Expressive handlers: An expressive handler...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Triton Inference Server

    Triton Inference Server

    The Triton Inference Server provides an optimized cloud

    Triton Inference Server is an open-source inference serving software that streamlines AI inferencing. Triton enables teams to deploy any AI model from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton supports inference across cloud, data center, edge, and embedded devices on NVIDIA GPUs, x86 and ARM CPU, or AWS Inferentia. Triton delivers optimized performance for many query types, including real-time, batched, ensembles, and audio/video streaming. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Scira

    Scira

    AI-powered search engine that helps you find information

    ...The project combines a modern web interface with retrieval-augmented generation techniques to deliver responses that are both natural language friendly and evidence oriented. It is built for developers who want to deploy their own Perplexity-style or AI search experience without relying on proprietary hosted services. Scira emphasizes speed, clean UI design, and extensibility so teams can customize data sources, models, and ranking logic. The architecture typically supports real-time querying, streaming responses, and modular backend components. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Plano

    Plano

    Delivery infrastructure for agentic apps

    Plano is an AI-native proxy and data plane designed to simplify the infrastructure required to deploy and operate agentic applications in production environments. It removes repetitive plumbing work from application code by centralizing capabilities such as agent routing, orchestration, guardrails, observability, and model selection. Built on modern proxy technology and compatible with any language or AI framework, Plano enables developers to focus on core agent logic instead of infrastructure complexity. ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB