Showing 246 open source projects for "deploy"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    AI Agents Masterclass

    AI Agents Masterclass

    Follow along with my AI Agents Masterclass videos

    AI Agents Masterclass is an educational open-source repository designed to teach developers how to build, train, and deploy intelligent AI agents using modern tooling and workflow patterns. The project includes structured lessons, code examples, and practical exercises that cover foundational concepts like prompt engineering, chaining agents, tool usage, plan execution, evaluation, and safety considerations. It breaks down how autonomous agents interact with external systems, handle iterative reasoning, and integrate with third-party services or APIs to perform real tasks — for example, web search, browsing, scheduling, or coding assistance. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Team9

    Team9

    Team9 is a collaborative workspace for AI agents

    Team9 is an AI-native collaborative workspace designed to deploy, manage, and coordinate autonomous agents as if they were members of a human team, enabling organizations to automate complex workflows with minimal setup. It builds on agent frameworks like OpenClaw and introduces a managed environment where agents can be assigned roles, share context, and execute tasks collaboratively. The system emphasizes a “local-first” architecture, allowing agents to run on user-controlled infrastructure while maintaining persistent memory and data privacy. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    One API

    One API

    The LLM API management & distribution system

    ...One API also provides a web-based dashboard for managing keys, monitoring usage, and configuring routing rules. Its architecture is designed for scalability, allowing teams to deploy it as a centralized layer in their AI infrastructure. By abstracting the complexity of working with multiple APIs, it simplifies development and reduces vendor lock-in.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ZML

    ZML

    Any model. Any hardware. Zero compromise

    ...The system allows models to be compiled and executed across multiple types of accelerators, including GPUs and TPUs, even when distributed across different machines or locations. One of its key strengths is cross-compilation, enabling developers to build once and deploy across various platforms without rewriting code. zml provides example implementations of models and workflows, demonstrating how to run inference tasks such as image classification or large language models. It is designed to handle complex distributed setups, including scenarios where model components are split across devices connected via networks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    TensorRT LLM

    TensorRT LLM

    TensorRT LLM provides users with an easy-to-use Python API

    TensorRT-LLM is an open-source high-performance inference library specifically designed to optimize and accelerate large language model deployment on NVIDIA GPUs. It provides a Python-based API built on top of PyTorch that allows developers to define, customize, and deploy LLMs efficiently across a variety of hardware configurations, from single GPUs to large multi-node clusters. The library focuses on maximizing throughput and minimizing latency through advanced techniques such as quantization, custom attention kernels, and optimized memory management strategies. It includes support for cutting-edge inference methods like speculative decoding and inflight batching, enabling real-time and large-scale AI applications. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    AgentScope

    AgentScope

    Build and run agents you can see, understand and trust

    AgentScope is a production-ready agent framework designed to help developers build, deploy, and scale intelligent agentic applications. It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    A.I.G

    A.I.G

    Full-stack AI Red Teaming platform

    ...It brings together AI infrastructure vulnerability scanning, MCP server risk analysis, and jailbreak evaluation into a unified workflow so that enterprises and individuals can identify critical security issues without relying on external services. Users can deploy it via Docker or scripts to get a modern web UI that guides them through tasks like scanning third-party frameworks for known CVEs and experimenting with prompt security against attack vectors. The tool provides both a visual interface and a comprehensive API, making integration with internal security systems or CI/CD pipelines practical for ongoing risk management.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    agentic-stack

    agentic-stack

    One brain, many harnesses. Portable .agent/ folder

    agentic-stack is a framework or toolkit designed to build, orchestrate, and deploy AI agents in a structured and scalable way. It likely provides components for managing agent workflows, communication, and task execution across different systems. The project emphasizes modularity, enabling developers to assemble custom pipelines using various AI models, tools, and APIs. It may include abstractions for memory, planning, and tool usage, reflecting modern agentic AI design patterns. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    NemoClaw

    NemoClaw

    NVIDIA plugin for secure installation of OpenClaw

    ...The platform integrates with AI models such as NVIDIA Nemotron and supports multiple inference backends including cloud APIs, local NIM deployments, and vLLM. Through its command-line interface, developers can deploy, monitor, and manage AI assistants running inside isolated sandboxes. By combining sandbox orchestration, agent management, and AI model integration, NemoClaw provides a secure foundation for building and operating autonomous AI assistants.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Lecca.io

    Lecca.io

    Lecca.io | AI Agents & Automations

    Lecca.io is an AI platform that allows you to configure and deploy Large Language Models (LLMs) equipped with powerful tools and workflows. Build, customize, and automate your AI agents with ease.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    smolagents

    smolagents

    Agents write python code to call tools and orchestrate other agents

    ...We provide our definition in this page, where you’ll also find tips for when to use them or not (spoilers: you’ll often be better off without agents). smolagents is a lightweight framework for building AI agents using large language models (LLMs). It simplifies the development of AI-driven applications by providing tools to create, train, and deploy language model-based agents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DeepPavlov

    DeepPavlov

    A library for deep learning end-to-end dialog systems and chatbots

    DeepPavlov makes it easy for beginners and experts to create dialogue systems. The best place to start is with user-friendly tutorials. They provide quick and convenient introduction on how to use DeepPavlov with complete, end-to-end examples. No installation needed. Guides explain the concepts and components of DeepPavlov. Follow step-by-step instructions to install, configure and extend DeepPavlov framework for your use case. DeepPavlov is an open-source framework for chatbots and virtual...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Rill

    Rill

    Fast SQL-based BI tool for real-time dashboards and analytics

    ...Rill supports local and remote data sources such as CSV, Parquet, S3, and GCS, making it flexible across environments. Its BI-as-code model combines SQL, YAML configuration, Git version control, and CLI tools, allowing teams to build, manage, and deploy analytics workflows efficiently. Automatic data profiling and responsive query updates help users understand datasets instantly. Interactive dashboards come with opinionated defaults, so teams can focus on insights instead of setup, while metrics layers standardize business logic for consistent reporting across dashboards, APIs, and AI systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Scira

    Scira

    AI-powered search engine that helps you find information

    ...The project combines a modern web interface with retrieval-augmented generation techniques to deliver responses that are both natural language friendly and evidence oriented. It is built for developers who want to deploy their own Perplexity-style or AI search experience without relying on proprietary hosted services. Scira emphasizes speed, clean UI design, and extensibility so teams can customize data sources, models, and ranking logic. The architecture typically supports real-time querying, streaming responses, and modular backend components. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Plano

    Plano

    Delivery infrastructure for agentic apps

    Plano is an AI-native proxy and data plane designed to simplify the infrastructure required to deploy and operate agentic applications in production environments. It removes repetitive plumbing work from application code by centralizing capabilities such as agent routing, orchestration, guardrails, observability, and model selection. Built on modern proxy technology and compatible with any language or AI framework, Plano enables developers to focus on core agent logic instead of infrastructure complexity. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    FastKoko

    FastKoko

    Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model

    FastKoko is a self-hosted text-to-speech server built around the Kokoro-82M model and exposed through a FastAPI backend. It is designed to be easy to deploy via Docker, with separate CPU and GPU images so that users can choose between pure CPU inference and NVIDIA GPU acceleration. The project exposes an OpenAI-compatible speech endpoint, which means existing code that talks to the OpenAI audio API can often be pointed at a Kokoro-FastAPI instance with minimal changes. It supports multiple languages and voicepacks and allows phoneme based generation for more accurate pronunciation and prosody. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    OpenMLSys-ZH

    OpenMLSys-ZH

    Machine Learning Systems: Design and Implementation

    ...The repository includes scripts or tooling to keep translation synchronized with upstream changes, versioning, and possibly translation metadata (contributors, timestamp). Users can browse or clone the translated documentation to follow along with the original content, deploy examples, or understand system internals in their preferred language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Portia SDK Python

    Portia SDK Python

    Portia Labs Python SDK for building agentic workflows

    portia‑sdk‑python is an open-source Python SDK by Portia Labs for creating reliable, stateful, authenticated multi-agent AI workflows. It supports tool-backed agents capable of real-world interactions—like web browsing, API access, and human-in-the-loop clarifications—while maintaining transparency and auditability through structured plans and execution hooks. Designed for production environments, the SDK integrates with local or cloud LLMs (e.g. OpenAI, Anthropic, Mistral, Gemini) and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pruna AI

    Pruna AI

    Pruna is a model optimization framework built for developers

    Pruna is an open-source, self-hostable AI inference engine designed to help teams deploy and manage large language models (LLMs) efficiently across private or hybrid infrastructures. Built with performance and developer ergonomics in mind, Pruna simplifies inference workflows by enabling multi-model orchestration, autoscaling, GPU resource allocation, and compatibility with popular open-source models. It is ideal for companies or teams looking to reduce reliance on external APIs while maintaining speed, cost-efficiency, and full control over their data and AI stack. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Kubeflow

    Kubeflow

    Machine Learning Toolkit for Kubernetes

    Kubeflow is an open source Cloud Native machine learning platform based on Google’s internal machine learning pipelines. It seeks to make deployments of machine learning workflows on Kubernetes simple, portable and scalable. With Kubeflow you can deploy best-of-breed open-source systems for ML to diverse infrastructures. You can also take advantage of a number of great features, such as services for managing Jupyter notebooks and support for a TensorFlow Serving container. Wherever you may be running Kubernetes, you can run Kubeflow as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Agent Development Kit (ADK) for Java

    Agent Development Kit (ADK) for Java

    An open-source, code-first Java toolkit

    Google’s Agent Development Kit for Java is an open-source toolkit that helps developers design, evaluate, and deploy advanced AI agents using the Java programming language. The framework follows a code-first approach that treats agent development as a structured software engineering task rather than a collection of prompt scripts. It provides abstractions and tools that allow developers to create agents capable of executing complex workflows, calling tools, and interacting with external services. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    OpenVINO Model Server

    OpenVINO Model Server

    A scalable inference server for models optimized with OpenVINO

    OpenVINO™ Model Server is a high-performance inference serving system designed to host and serve machine learning models that have been optimized with the OpenVINO toolkit. It’s implemented in C++ for scalability and efficiency, making it suitable for both edge and cloud deployments where inference workloads must be reliable and high throughput. The server exposes model inference via standard network protocols like REST and gRPC, allowing any client that speaks those protocols to request...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    NVIDIA NeMo Framework

    NVIDIA NeMo Framework

    Scalable generative AI framework built for researchers and developers

    NVIDIA NeMo is a scalable, cloud-native generative AI framework aimed at researchers and PyTorch developers working on large language models, multimodal models, and speech AI (ASR and TTS), with growing support for computer vision. It provides collections of domain-specific modules and reference implementations that make it easier to pre-train, fine-tune, and deploy very large models on multi-GPU and multi-node infrastructure. NeMo 2.0 introduces a Python-based configuration system, replacing YAML with more flexible, programmable configs that can be versioned and composed for different experiments. The framework builds on PyTorch Lightning–style modular abstractions, so training scripts are composed from reusable components for data loading, models, optimizers, and schedulers, which simplifies experimentation and adaptation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Open Agents

    Open Agents

    An open source template for building cloud agents

    ...It emphasizes openness and interoperability, making it easier to integrate with different models, APIs, and external systems. The project also includes examples and templates that demonstrate how to build and deploy agents for real-world applications. By prioritizing composability, it allows developers to combine simple components into more complex agent systems. Overall, open-agents serves as a playground for building and experimenting with next-generation AI agent architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Bindu

    Bindu

    Bindu: Turn any AI agent into a living microservice

    ...Once integrated, the agent gains a decentralized identity, standardized communication capabilities through protocols such as A2A and AP2, and built-in support for authentication and monetization. The system is designed to be framework-agnostic, meaning developers can build agents using tools like LangChain, OpenAI SDK, or custom implementations and still deploy them seamlessly. Bindu also introduces the concept of an “Internet of Agents,” where multiple specialized agents collaborate, discover each other, and exchange services autonomously.
    Downloads: 0 This Week
    Last Update:
    See Project