Open Source Observability Tools - Page 2

Observability Tools

View 154 business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    tapir

    tapir

    Declarative, type-safe web endpoints library

    Declarative, type-safe web endpoints library. With tapir, you can describe HTTP API endpoints as immutable Scala values. Each endpoint can contain a number of input and output parameters. Compile-time guarantees, develop-time completions, read-time information. Separate the shape of the endpoint (the "what"), from the server logic (the "how"). Generate documentation from endpoint descriptions. Leverage the metadata to report rich metrics and tracing information. Re-use common endpoint definitions, as well as individual inputs/outputs. Library, not a framework, integrates with your stack. Is your company already using tapir? We're continually expanding the "adopters" section in the documentation; the more the merrier! It would be great to feature your company's logo, but in order to do that, we'll need to write permission to avoid any legal misunderstandings.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    BFE

    BFE

    A modern layer 7 load balancer from baidu

    BFE (Beyond Front End) is a modern layer 7 load balancer from baidu. BFE has a builtin plugin framework that makes it possible to develop new features rapidly by writing plugins. BFE is designed to provide every tenant a dedicated share of the instance. Each tenant’s configuration is isolated and remains invisible to other tenants. BFE supports HTTP, HTTPS, SPDY, HTTP2, gRPC, WebSocket, TLS, FastCGI, etc. Future support is planned for HTTP/3. BFE provides an advanced domain-specific language to describe routing rules which are easy to understand and maintain. BFE supports global load balancing and distributed load balancing for zone aware balancing, zone level failure resilience, overload protection etc. BFE provides a rich set of plugins for traffic management, security, observability, etc. BFE includes detailed built-in metrics for all subsystems. BFE writes various logs for trouble shooting, data analysis and visualization. BFE also supports distributed tracing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Devtron

    Devtron

    Tool integration platform for Kubernetes

    Devtron deeply integrates with products across the lifecycle of microservices,i.e., CI, CD, security, cost, debugging, and observability via an intuitive web interface. Devtron is designed to be modular, and its functionality can be easily extended with the help of integrations. Devtron CI/CD with GitOps integration is used to automate the builds and deployments and enables the software development teams to focus on meeting the business requirements, code quality, and security. Devtron leverages Kubernetes auto-scaling and centralized caching to give you unlimited cost-efficient CI workers. Supports pre-CI and post-CI integrations for code quality monitoring. Provides deployment metrics like; deployment frequency, lead time, change failure rate, and mean-time recovery. Seamlessly integrates with Grafana for continuous application metrics like CPU and memory usage, status code, throughput, and latency on the dashboard.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Envoy

    Envoy

    Cloud-native high-performance edge/middle/service proxy

    Envoy is an open source, high-performance edge/middle/service proxy designed for cloud-native applications. It was built by Lyft to solve the common problem of networking and observability when moving to a distributed architecture. Envoy is a proxy designed for single services and applications. Aside from that it is also a communication bus and “universal data plane” designed for large microservice “service mesh” architectures. It runs right along with every application, and abstracts the network by providing common features in a platform-agnostic manner. With Envoy, visualizing problem areas becomes a lot easier thanks to consistent observability. It also helps with overall performance tuning, and easily adding substrate features in one place.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    KubeSphere

    KubeSphere

    The container platform tailored for Kubernetes multi-cloud, datacenter

    KubeSphere is a distributed operating system for cloud-native application management, using Kubernetes as its kernel. It provides a plug-and-play architecture, allowing third-party applications to be seamlessly integrated into its ecosystem. KubeSphere is also a multi-tenant container platform with full-stack automated IT operation and streamlined DevOps workflows. It provides developer-friendly wizard web UI, helping enterprises to build out a more robust and feature-rich platform, which includes most common functionalities needed for enterprise Kubernetes strategy, see Feature List for details. KubeSphere Lite provides you with free, stable, and out-of-the-box managed cluster service. After registration and login, you can easily create a K8s cluster with KubeSphere installed in only 5 seconds and experience feature-rich KubeSphere.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    KubeVela

    KubeVela

    The Modern Application Platform

    KubeVela is a modern software delivery platform that makes deploying and operating applications across today's hybrid, multi-cloud environments easier, faster and more reliable. KubeVela is infrastructure agnostic, programmable, yet most importantly, application-centric. It allows you to build powerful software, and deliver them anywhere. Declare your deployment plan as workflow, run it automatically with any CI/CD or GitOps system, extend or re-program the workflow steps with CUE. Glue and orchestrate all your infrastructure capabilities as reusable modules and share the large growing community addons. No ad-hoc scripts, no dirty glue code, just deploy. The deployment workflow in KubeVela is powered by Open Application Model.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Kyma

    Kyma

    Kyma is an opinionated set of Kubernetes-based modular building blocks

    Kyma is an opinionated set of Kubernetes-based modular building blocks, including all necessary capabilities to develop and run enterprise-grade cloud-native applications. It is the open path to the SAP ecosystem supporting business scenarios end-to-end. Kyma is an actively maintained open-source project supported by SAP. The Kyma project is also a foundation of SAP BTP, Kyma runtime which is a part of SAP Business Technology Platform (BTP). You can use Kyma modules in your own Kubernetes cluster, or try the managed version from SAP BTP with a ready-to-use Kubernetes cluster powered by Gardener. Kyma is built upon leading cloud-native, open-source projects and open standards, such as Istio, NATS, Cloud Events, and Open Telemetry. We created an opinionated set of modules you can easily enable in your Kubernetes cluster to speed up cloud application development and operations. With Kyma, you save the time to pick the right tools and the effort to keep them secure and up to date.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Loggie

    Loggie

    A lightweight, cloud-native data transfer agent and aggregator

    Loggie is a lightweight, high-performance, cloud-native agent and aggregator based on Golang. Loggie includes LogConfig/ClusterLogConfig/Interceptor/Sink CRDs, allowing for the creation of data collection, transfer, processing, and sending pipelines through simple YAML file creation. Supports deployment as an independent intermediate machine, which can receive aggregated data sent by Loggie Agent and can also be used to consume and process various data sources. Configure Filebeat and Loggie to collect logs, and send them to a Kafka topic without using client compression, with the Kafka topic partition configured as 3. With sufficient resources for the Agent specification, modify the number of files collected, the concurrency of the sending client (configure Filebeat worker and Loggie parallelism), and observe their respective CPU, memory, and pod network card transmission rates.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Micrometer

    Micrometer

    App observability facade for the most popular observability tools

    Micrometer provides a simple facade over the instrumentation clients for the most popular observability systems, allowing you to instrument your JVM-based application code without vendor lock-in. Think SLF4J, but for observability. Micrometer provides vendor-neutral interfaces for timers, gauges, counters, distribution summaries, and long task timers with a dimensional data model that, when paired with a dimensional monitoring system, allows for efficient access to a particular named metric with the ability to drill down across its dimensions. Out-of-the-box instrumentation of caches, the class loader, garbage collection, processor utilization, thread pools, and more tailored to actionable insight. Micrometer is the instrumentation library powering the delivery of application observability from Spring Boot applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    twemproxy

    twemproxy

    A fast, light-weight proxy for memcached and redis

    twemproxy (pronounced "two-em-proxy"), aka nutcracker is a fast and lightweight proxy for memcached and redis protocol. It was built primarily to reduce the number of connections to the caching servers on the backend. This, together with protocol pipelining and sharding enables you to horizontally scale your distributed caching architecture. Fast and lightweight. Maintains persistent server connections. Keeps connection count on the backend caching servers low. Enables pipelining of requests and responses. Supports proxying to multiple servers. Supports multiple server pools simultaneously. Shard data automatically across multiple servers. Implements the complete memcached ascii and redis protocol. Easy configuration of server pools through a YAML file. Supports multiple hashing modes including consistent hashing and distribution. Can be configured to disable nodes on failures.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    SysGauge

    SysGauge

    Comprehensive system monitor: real‑time performance, network, disk

    SysGauge is a powerful system monitoring and performance analysis tool for Windows that delivers deep insights into your computer’s operation. It provides real‑time visualization of key metrics across multiple domains: CPU and memory usage, disk I/O, network traffic, and active processes. The application features a customizable dashboard with live graphs, gauges, and tables, letting you track trends and spot bottlenecks at a glance. SysGauge supports historical data logging for post‑analysis and capacity planning, and offers alerting for critical conditions (e.g., high CPU load, low disk space). It includes advanced process monitoring with detailed resource breakdowns, service status checks, and network connection tracking.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    LibreDiagnostics

    LibreDiagnostics

    A desktop utility for real-time hardware monitoring and diagnostics.

    Monitoring for your hardware components - CPU - GPU - RAM - Drives - Network - Fans Various customization options, such as - display the app as an app bar (Windows) - autostart after you initially logged in (Windows) - docking location[s] on different screens - adjust application font size & width - tailor update interval to your needs - various color options for you to modify - enable or disable specific monitors, hardware or sensors - and more ! S.M.A.R.T. data of your drives You can show various S.M.A.R.T. attributes of your drives, such as health, temperature, power-on hours and more, by simply clicking on the drives name in the drives monitor. Please check out Github for more details and up-to-date new releases / versions ! https://github.com/Blacktempel/LibreDiagnostics For Linux users: you currently have to build the binaries by yourself and functionality may be limited. There are plans to improve this in future, but not on high priority.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Apache APISIX

    Apache APISIX

    The cloud-native API gateway

    Provides rich traffic management features such as load balancing, dynamic upstream, canary release, circuit breaking, authentication, observability, and more. Based on the Nginx library and etcd. Cloud-native microservices API gateway, delivering the ultimate performance, security, open source and scalable platform for all your APIs and microservices. Apache APISIX is based on Nginx and etcd. Compared with traditional API gateways, APISIX has dynamic routing and plug-in hot loading, which is especially suitable for API management under micro-service system. You can use Apache APISIX as a traffic entrance to process all business data, including dynamic routing, dynamic upstream, dynamic certificates, A/B testing, canary release, blue-green deployment, limit rate, defense against malicious attacks, metrics, monitoring alarms, service observability, service governance, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Apache APISIX for Kubernetes

    Apache APISIX for Kubernetes

    APISIX Ingress Controller for Kubernetes

    Apache APISIX provides rich traffic management features like Load Balancing, Dynamic Upstream, Canary Release, Circuit Breaking, Authentication, Observability, etc. Apache APISIX provides open source API Gateway to help you manage microservices, delivering the ultimate performance, security, and scalable platform for all your APIs and microservices. Apache APISIX is the first open-source API Gateway that includes a built-in low-code Dashboard, which offers a powerful and flexible UI for developers to use. The Apache APISIX Dashboard is designed to make it as easy as possible for users to operate Apache APISIX through a frontend interface. It’s opensource and ever-evolving, feel free to contribute.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Apache SkyWalking Rocketbot UI

    Apache SkyWalking Rocketbot UI

    SkyWalking RocketBot UI

    Application performance monitor tool for distributed systems, specially designed for microservices, cloud-native, and container-based (Kubernetes) architectures. End-to-end distributed tracing. Service topology analysis, service-centric observability and API dashboards. Java, .Net Core, PHP, NodeJS, Golang, LUA, Rust, C++, Client JavaScript and Python agents with active development and maintenance. Rover agent works as a metrics collector and profiler powered by eBPF to diagnose CPU and network performance. 100+ billion telemetry data could be collected and analyzed from one SkyWalking cluster. Metrics, Traces, and Logs from mature ecosystems are supported, e.g. Zipkin, OpenTelemetry, Prometheus, Zabbix, Fluentd. BanyanDB, an observability database, created in 2022, aims to ingest, analyze and store telemetry/observability data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Cilium

    Cilium

    eBPF-based networking, security, and observability

    Cilium is open-source software for providing, securing and observing network connectivity between container workloads, cloud-native, and fueled by the revolutionary Kernel technology eBPF. Kubernetes doesn't come with an implementation of Load Balancing. This is usually left as an exercise for your cloud provider or in private cloud environments an exercise for your networking team. Cilium can attract this traffic with BGP and accelerate leveraging XDP and eBPF. Together these technologies provide a very robust and secure implementation of Load Balancing. Cilium and eBPF operate at the kernel layer. With this level of context we can make intelligent decisions about how to connect different workloads whether on the same node or between clusters. With eBPF and XDP Cilium enables significant improvements in latency and performance and eliminates the need for kube-proxy entirely.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Deckhouse

    Deckhouse

    Kubernetes platform from Flant

    Deckhouse is a Kubernetes platform that allows you to create homogeneous K8s clusters on any infrastructure. It manages clusters comprehensively and “automagically” and provides all necessary modules and add-ons for autoscaling, observability, security, and service mesh implementation. Deckhouse has vanilla Kubernetes under the hood and integrates a balanced set of Open Source tools that have become the industry standard. Out-of-the-box secure configuration of the Kubernetes cluster, least component privileges, pre-configured role model, end-to-end object identity in the audit system, and integration with external directory services. Built-in implementation of Pod Security Standards and a ready-to-use, extensible set of recommended policies. Deckhouse automates many routine deployment, scaling, and infrastructure management operations out of the box. It manages system software on the nodes (kernel, CRI, kubelet), basic Kubernetes components (control plane, etc, certificates, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    EaseAgent

    EaseAgent

    An agent component for the Java system

    A lightweight & opening Java Agent for Cloud-Native and APM system. EaseAgent is the underlying component that provides non-intrusive extensions to applications of the Java ecosystem. EaseAgent can collect distributed application tracing, metrics, and logs, which could be used in the APM system and improve the observability of a distributed system. for the tracing, EaseAgent follows the Google Dapper paper. EaseAgent also can work with Cloud-Native architecture. For example, it can help Service Mesh (especially for EaseMesh ) to do some control panel work. EaseAgent supports plugin mechanism development, which is easy to extend or add new functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Elastic APM Node.js Agent

    Elastic APM Node.js Agent

    Elastic APM Node.js Agent

    This is the official Node.js application performance monitoring (APM) agent for the Elastic Observability solution. It is a Node.js package that runs with your Node.js application to automatically capture errors, tracing data, and performance metrics. APM data is sent to your Elastic Observability deployment -- hosted in Elastic's cloud or in your own on-premises deployment -- where you can monitor your application, create alerts, and quick identify root causes of service issues. First, you will need an Elastic Stack deployment. This is a deployment of APM Server (which receives APM data from the APM agent running in your application), Elasticsearch (the database that stores all APM data), and Kibana (the application that provides the interface to visualize and analyze the data). If you do not already have an Elastic deployment to use, follow this APM Quick Start guide to create a free trial on Elastic's cloud.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Elementary

    Elementary

    Open-source data observability for analytics engineers

    Elementary is an open-source data observability solution for data & analytics engineers. Monitor your dbt project and data in minutes, and be the first to know of data issues. Gain immediate visibility, detect data issues, send actionable alerts, and understand the impact and root cause. Generate a data observability report, host it or share with your team. Monitoring of data quality metrics, freshness, volume and schema changes, including anomaly detection. Elementary data monitors are configured and executed like native tests in dbt your project. Uploading and modeling of dbt artifacts, run and test results to tables as part of your runs. Get informative notifications on data issues, schema changes, models and tests failures. Inspect upstream and downstream dependencies to understand impact and root cause of data issues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Eye

    Eye

    Process monitoring tool. Inspired from Bluepill and God

    Process monitoring tool. Inspired from Bluepill and God. Requires Ruby(MRI) >= 1.9.3-p194. Uses Celluloid and Celluloid::IO. Eye is an image processing and analysis library for quickly analyzing image patterns and features, often used in computer vision tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Fluid

    Fluid

    Fluid, elastic data abstraction and acceleration for BigData/AI apps

    Fluid, elastic data abstraction and acceleration for BigData/AI applications in the cloud. Provide DataSet abstraction for underlying heterogeneous data sources with multidimensional management in a cloud environment. Enable dataset warmup and acceleration for data-intensive applications by using a distributed cache in Kubernetes with observability, portability, and scalability. Taking characteristics of application and data into consideration for cloud application/dataset scheduling to improve the performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Go gRPC Middleware

    Go gRPC Middleware

    Golang gRPC Middlewares: interceptor chaining, auth, logging, retries

    gRPC Go has support for "interceptors", i.e. middleware that is executed either on the gRPC Server before the request is passed onto the user's application logic, or on the gRPC client either around the user call. It is a perfect way to implement common patterns: auth, logging, tracing, metrics, validation, retries, rate limiting, and more, which can be great generic building blocks that make it easy to build multiple microservices. Especially for observability signals (logging, tracing, metrics) interceptors offer semi-auto-instrumentation that improves the consistency of your observability and allows great correlation techniques (e.g. exemplars and trace ID in logs). Demo-ed in examples. This repository offers ready-to-use middleware that implements gRPC interceptors with examples. In some cases, dedicated projects offer great interceptors, so this repository skips those, and we link them in the interceptors list.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    GoFrame

    GoFrame

    GoFrame is a modular, powerful, high-performance development framework

    GoFrame is a modular, powerful, high-performance, and enterprise-class application development framework of Golang. Modular, loosely coupled design, rich components, out-of-the-box, automatic codes generating for efficiency, simple and easy to use, detailed documentation, interface designed components, with high scalability, fully supported tracing and error stack feature, specially developed and powerful ORM component. Robust engineering design specifications, convenient development CLI tool provide, OpenTelemetry observability features support, OpenAPIV3 documentation generating, automatically, and much, much more...ready to explore?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Helicone

    Helicone

    Open source LLM-Observability Platform for Developers

    Open source LLM-Observability Platform for Developers. One-line integration for monitoring, metrics, evals, agent tracing, prompt management, playground, etc. Supports OpenAI SDK, Vercel AI SDK, Anthropic SDK, LiteLLM, LLamaIndex, LangChain, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB