Compare the Top Runbook Automation Platforms that integrate with Datadog as of November 2025

This a list of Runbook Automation platforms that integrate with Datadog. Use the filters on the left to add additional filters for products that have integrations with Datadog. View the products that work with Datadog in the table below.

What are Runbook Automation Platforms for Datadog?

Runbook automation platforms are designed to automate repetitive and routine IT operations tasks, improving efficiency and reducing human error. These platforms allow businesses to create, manage, and execute workflows (runbooks) that automate system monitoring, incident response, software deployments, patch management, and other critical operations tasks. By integrating with various IT systems, cloud services, and monitoring tools, runbook automation platforms enable IT teams to respond to events and incidents in real-time, following predefined processes to maintain system uptime and compliance. Additionally, they often include features for error handling, logging, and alerting, ensuring that operations are performed smoothly and that issues are addressed proactively. These platforms help businesses achieve faster response times, improve operational consistency, and enhance scalability. Compare and read user reviews of the best Runbook Automation platforms for Datadog currently available using the table below. This list is updated regularly.

  • 1
    PagerDuty

    PagerDuty

    PagerDuty

    PagerDuty, Inc. (NYSE:PD) is a leader in digital operations management. In an always-on world, organizations of all sizes trust PagerDuty to help them deliver a perfect digital experience to their customers, every time. Teams use PagerDuty to identify issues and opportunities in real time and bring together the right people to fix problems faster and prevent them in the future. PagerDuty's ecosystem of over 350+ integrations, including Slack, Zoom, ServiceNow, AWS, Microsoft Teams, Salesforce, and more, enable teams to centralize their technology stack, get a holistic view of their operations, and optimize processes within their toolsets.
  • 2
    Squadcast

    Squadcast

    Squadcast

    Squadcast is an incident management tool that’s purpose-built for SRE. Create a blameless culture by reducing the need for physical war rooms, centralize SLO dashboards, unify internal and external SLIs and automate incident resolution and knowledge base creation with Squadcast Actions. Adopt world-class site reliability practices with a centralized SLO dashboard to view your system health. Anticipate incidents before they occur and respond proactively. The first step towards doing better incident management is adding enough context to incidents while they get detected. With Squadcast, discover everything you need, to take action and achieve best-in-class MTTD with highly configurable features like alert deduplication and tagging.
    Starting Price: Free
  • 3
    StackStorm

    StackStorm

    StackStorm

    StackStorm connects all your apps, services, and workflows. From simple if/then rules to complicated workflows, StackStorm lets you automate DevOps your way. No need to change your existing processes or workflows, StackStorm connects what you already have. Community is what makes a good product great. StackStorm is used by a lot of people around the world, and you can always count on getting answers to your questions. Stackstorm can be used to automate and streamline nearly any part of your business. Here are some of the most common applications. When failures happen, StackStorm can act as Tier 1 support: It troubleshoots, fixes known problems, and escalates to humans when needed. Continuous deployment can get complex, beyond Jenkins or other specialized opinionated tools. Automate advanced CI/CD pipelines your way. ChatOps brings automation and collaboration together; transforming devops teams to get things done better, faster, and with style.
  • 4
    Shoreline

    Shoreline

    Shoreline.io

    Shoreline is the Cloud Reliability platform — the only platform that lets DevOps engineers build automations in an afternoon, and fix issues forever. Shoreline reduces on-call complexity by running across clouds, Kubernetes clusters, and VMs allowing operators to manage their entire fleet as if it were a single box. Debugging and repairing issues is easy with advanced tooling for your best SREs, automated runbooks for the broader team, and a platform that makes building automations 30X faster. Shoreline does the heavy lifting, setting up monitors and building repair scripts, so that customers only need to configure them for their environment. Shoreline’s modern “Operations at the Edge” architecture runs efficient agents in the background of all monitored hosts. Agents run as a DaemonSet on Kubernetes or an installed package on VMs (apt, yum). The Shoreline backend is hosted by Shoreline in AWS, or deployed in your AWS virtual private cloud.
  • 5
    Rootly

    Rootly

    Rootly

    Rootly is an AI-native incident management platform built to help modern teams prevent and resolve incidents faster. It streamlines on-call scheduling, incident response, retrospectives, and status updates through intelligent automation and deep integrations with Slack, Teams, Jira, and Zoom. Powered by Rootly AI, the system automates root cause analysis, provides suggested fixes, and compiles incident data into clear summaries for faster recovery. Teams can manage incidents directly within their communication tools, reducing context switching and human error. With automated retrospectives and actionable insights, Rootly enables continuous improvement and reliability across engineering organizations. Trusted by global brands like Figma, Canva, Nvidia, and Webflow, it helps companies maintain uptime, minimize disruption, and create a culture of proactive resilience.
  • 6
    Doctor Droid

    Doctor Droid

    Doctor Droid

    ​Doctor Droid is an AI-driven platform designed to revolutionize monitoring and troubleshooting for engineering teams. It automates complex investigations, following standard operating procedures to analyze data across multiple integrations, identify root causes, and execute standard runbooks for self-healing. By proactively listening for alerts, Doctor Droid prepares relevant data and insights, reducing on-call time by up to 80% and enabling engineers to respond swiftly. It facilitates rapid onboarding of new engineers by automating the search for documents, learning new tools, and understanding data, allowing them to become primary on-calls from day one. With the capability to perform ad-hoc investigations, such as analyzing Kubernetes clusters or checking recent deployments, Doctor Droid adapts and creates new plans based on suggestions and existing documents. It integrates seamlessly with over 40 tools across the stack.
    Starting Price: $99 per month
  • 7
    Cutover

    Cutover

    Cutover

    The Cutover platform enables enterprises to simplify complexity, streamline work, and increase visibility. Cutover’s AI-powered automated runbooks connect teams, technology, and systems, increasing efficiency and reducing risk in IT disaster and cyber recovery, cloud migration, release management, and technology implementation. As a centralized system of execution, Cutover differentiates itself with scalable and proven dynamic, automated runbook technology that transforms enterprise IT operations with a new way of working. Cutover enables the creation of a template library of comprehensive, executable, and auditable runbooks covering the entire IT infrastructure. Cutover is trusted by world-leading institutions, including the three largest US banks and three of the world’s five largest investment banks.
  • Previous
  • You're on page 1
  • Next