Observability Tools for Linux

View 44 business solutions

Browse free open source Observability tools and projects for Linux below. Use the toggles on the left to filter open source Observability tools by OS, license, language, programming language, and project status.

  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Grafana

    Grafana

    Leading open-source visualization and observability platform

    Grafana OSS is the leading open-source platform for visualization and observability. It enables teams to query, visualize, alert on, and explore telemetry data from multiple sources in a single interface. With support for 100+ data source plugins—including Prometheus, Loki, Elasticsearch, InfluxDB, SQL/NoSQL databases, and OpenTelemetry—Grafana helps teams correlate metrics, logs, and traces across applications and infrastructure. Users can build interactive dashboards with rich visualizations, template variables, and reusable panels to monitor systems and troubleshoot issues in real time. Grafana includes capabilities such as ad hoc data exploration, alerting, annotations, and flexible query support. Its extensible plugin ecosystem integrates with cloud platforms, databases, and developer tools—allowing teams to build observability workflows without vendor lock-in. The easiest way to get started with Grafana is with Grafana Cloud, our fully managed, full-stack observability platform.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    Conduit

    Conduit

    Conduit streams data between data stores. Kafka Connect replacement

    Conduit is a data streaming tool written in Go. It aims to provide the best user experience for building and running real-time data pipelines. Conduit comes with batteries included, it provides a UI, common connectors, processors and observability data out of the box. Sync data between your production systems using an extensible, event-first experience with minimal dependencies that fit within your existing workflow. Eliminate the multi-step process you go through today. Just download the binary and start building. Conduit connectors give you the ability to pull and push data to any production datastore you need. If a datastore is missing, the simple SDK allows you to extend Conduit where you need it. Conduit pipelines listen for changes to a database, data warehouse, etc., and allows your data applications to act upon those changes in real-time. Run it in a way that works for you; use it as a standalone service or orchestrate it within your infrastructure.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 3
    OneUptime

    OneUptime

    OneUptime is the complete open-source observability platform

    OneUptime is a comprehensive solution for monitoring and managing your online services. Whether you need to check the availability of your website, dashboard, API, or any other online resource, OneUptime can alert your team when downtime happens and keep your customers informed with a status page. OneUptime also helps you handle incidents, set up on-call rotations, run tests, secure your services, analyze logs, track performance, and debug errors.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    Jaeger

    Jaeger

    Monitor and troubleshoot transactions in complex distributed systems

    As on-the-ground microservice practitioners are quickly realizing, the majority of operational problems that arise when moving to a distributed architecture are ultimately grounded in two areas: networking and observability. It is simply an orders of magnitude larger problem to network and debug a set of intertwined distributed services versus a single monolithic application. Jaeger, inspired by Dapper and OpenZipkin, is a distributed tracing system released as open source by Uber Technologies. It is used for monitoring and troubleshooting microservices-based distributed systems. OpenTracing compatible data model and instrumentation libraries include Go, Java, Node, Python, C++ and C#. Jaeger uses consistent upfront sampling with individual per service/endpoint probabilities and it has multiple storage backends: Cassandra, Elasticsearch, memory.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    OpenClaw Opik Observability Plugin

    OpenClaw Opik Observability Plugin

    Official plugin for OpenClaw that exports agent traces to Opik

    OpenClaw Opik Observability Plugin is an open-source plugin designed to add observability and monitoring capabilities to OpenClaw autonomous AI agents by exporting operational traces to the Opik observability platform. The project integrates directly with OpenClaw’s plugin architecture so that developers can capture detailed runtime information about how their agents behave while executing tasks. Each time an AI agent performs an action—such as calling a large language model, invoking a tool, accessing memory, or delegating to a sub-agent—the plugin records the full interaction and sends it to Opik for analysis and visualization. This allows developers to inspect inputs, outputs, token usage, latency, and execution flow across complex multi-step agent workflows. The goal of the project is to provide transparency into the internal reasoning and operational pipeline of agent systems so developers can diagnose failures, control costs, and improve reliability.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Opik

    Opik

    Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI

    Confidently evaluate, test, and monitor LLM applications. Opik is an open-source platform for evaluating, testing, and monitoring LLM applications. Built by Comet. Record, sort, search, and understand each step your LLM app takes to generate a response. Manually annotate, view, and compare LLM responses in a user-friendly table. Log traces during development and in production. Run experiments with different prompts and evaluate against a test set. Choose and run pre-configured evaluation metrics or define your own with our convenient SDK library. Consult built-in LLM judges for complex issues like hallucination detection, factuality, and moderation.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    Coroot

    Coroot

    Open-source observability for microservices

    Collecting metrics, logs, and traces alone doesn't make your applications observable. Coroot turns that data into actionable insights for you. Enable system observability in minutes, no code changes required. Each release is automatically compared with the previous one, so you'll never miss even the slightest performance degradation. With integrated Cost Monitoring, developers can track how each change affects their cloud bill. Understand your cloud costs down to any given application. Doesn't require access to your cloud account or any other configurations. Analyze any unexpected spike in CPU or memory usage down to the precise line of code. Don't make assumptions, know exactly what the resources were spent on. Easily investigate any anomaly by comparing it to the system's baseline behavior.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Envoy

    Envoy

    Cloud-native high-performance edge/middle/service proxy

    Envoy is an open source, high-performance edge/middle/service proxy designed for cloud-native applications. It was built by Lyft to solve the common problem of networking and observability when moving to a distributed architecture. Envoy is a proxy designed for single services and applications. Aside from that it is also a communication bus and “universal data plane” designed for large microservice “service mesh” architectures. It runs right along with every application, and abstracts the network by providing common features in a platform-agnostic manner. With Envoy, visualizing problem areas becomes a lot easier thanks to consistent observability. It also helps with overall performance tuning, and easily adding substrate features in one place.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Grafana Pyroscope

    Grafana Pyroscope

    Continuous Profiling Platform. Debug performance issues

    Find and debug your most painful performance issues across code, infrastructure and CI/CD pipelines. Let you tag your data on the dimensions important for your organization. Allows you to store large volumes of high cardinality profiling data cheaply and efficiently. FlameQL enables custom queries to select and aggregate profiles quickly and efficiently for easy analysis. Analyze application performance profiles using our suite of profiling tools. Understand usage of CPU and memory resources at any point in time and identify performance issue before your customer do. Collect, store, and analyze profiles from various external profiling tools in one central location. Link to your Open Telemetry tracing data and get request-specific or span-specific profiles to enhance other observability data like traces and logs.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    LangCheck

    LangCheck

    Simple, Pythonic building blocks to evaluate LLM applications

    Simple, Pythonic building blocks to evaluate LLM applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Lunary

    Lunary

    The production toolkit for LLMs. Observability, prompt management

    Lunary helps developers of LLM Chatbots develop and improve them.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    OpenLIT

    OpenLIT

    OpenLIT is an open-source LLM Observability tool

    OpenLIT is an OpenTelemetry-native tool designed to help developers gain insights into the performance of their LLM applications in production. It automatically collects LLM input and output metadata and monitors GPU performance for self-hosted LLMs. OpenLIT makes integrating observability into GenAI projects effortless with just a single line of code. Whether you're working with popular LLM providers such as OpenAI and HuggingFace, or leveraging vector databases like ChromaDB, OpenLIT ensures your applications are monitored seamlessly, providing critical insights including GPU performance stats for self-hosted LLMs to improve performance and reliability. This project proudly follows the Semantic Conventions of the OpenTelemetry community, consistently updating to align with the latest standards in observability.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Vector

    Vector

    A high-performance observability data pipeline

    Vector is a Rust‑based, high‑performance observability data pipeline tool (agent + aggregator) designed to collect, transform, and route logs and metrics at scale. Created by Datadog, it aims to be the only tool needed from ingestion to vendor output, providing cost-efficient, safe, and flexible telemetry processing.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Dagster

    Dagster

    An orchestration platform for the development, production

    Dagster is an orchestration platform for the development, production, and observation of data assets. Dagster as a productivity platform: With Dagster, you can focus on running tasks, or you can identify the key assets you need to create using a declarative approach. Embrace CI/CD best practices from the get-go: build reusable components, spot data quality issues, and flag bugs early. Dagster as a robust orchestration engine: Put your pipelines into production with a robust multi-tenant, multi-tool engine that scales technically and organizationally. Dagster as a unified control plane: The ‘single plane of glass’ data teams love to use. Rein in the chaos and maintain control over your data as the complexity scales. Centralize your metadata in one tool with built-in observability, diagnostics, cataloging, and lineage. Spot any issues and identify performance improvement opportunities.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Devtron

    Devtron

    Tool integration platform for Kubernetes

    Devtron deeply integrates with products across the lifecycle of microservices,i.e., CI, CD, security, cost, debugging, and observability via an intuitive web interface. Devtron is designed to be modular, and its functionality can be easily extended with the help of integrations. Devtron CI/CD with GitOps integration is used to automate the builds and deployments and enables the software development teams to focus on meeting the business requirements, code quality, and security. Devtron leverages Kubernetes auto-scaling and centralized caching to give you unlimited cost-efficient CI workers. Supports pre-CI and post-CI integrations for code quality monitoring. Provides deployment metrics like; deployment frequency, lead time, change failure rate, and mean-time recovery. Seamlessly integrates with Grafana for continuous application metrics like CPU and memory usage, status code, throughput, and latency on the dashboard.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Fluid

    Fluid

    Fluid, elastic data abstraction and acceleration for BigData/AI apps

    Fluid, elastic data abstraction and acceleration for BigData/AI applications in the cloud. Provide DataSet abstraction for underlying heterogeneous data sources with multidimensional management in a cloud environment. Enable dataset warmup and acceleration for data-intensive applications by using a distributed cache in Kubernetes with observability, portability, and scalability. Taking characteristics of application and data into consideration for cloud application/dataset scheduling to improve the performance.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    OpenTelemetry Collector

    OpenTelemetry Collector

    OpenTelemetry Collector

    The OpenTelemetry Collector offers a vendor-agnostic implementation on how to receive, process, and export telemetry data. In addition, it removes the need to run, operate, and maintain multiple agents/collectors in order to support open-source telemetry data formats (e.g. Jaeger, Prometheus, etc.) to multiple open-source or commercial back-ends.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    qryn

    qryn

    All-in-one Polyglot Observability stack with ClickHouse storage

    All the greatest observability formats and integrations you love, at once - LGTM Drop-in compatible. Let's get Polyglot. qryn independently implements popular observability standards, protocols and query languages. Make sure you have sufficient memory and disk resources allocated for your node service and clickhouse server when dealing with large amounts of data and fingerprints. We suggest 8GB RAM or higher for most setups with 100k-1M fingerprints. Observe your daily and weekly data consumption to forecast your disk usage requirements. Compression codecs and other optimizations can be performed at the ClickHouse level.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Eye

    Eye

    Process monitoring tool. Inspired from Bluepill and God

    Process monitoring tool. Inspired from Bluepill and God. Requires Ruby(MRI) >= 1.9.3-p194. Uses Celluloid and Celluloid::IO. Eye is an image processing and analysis library for quickly analyzing image patterns and features, often used in computer vision tasks.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    KubeSphere

    KubeSphere

    The container platform tailored for Kubernetes multi-cloud, datacenter

    KubeSphere is a distributed operating system for cloud-native application management, using Kubernetes as its kernel. It provides a plug-and-play architecture, allowing third-party applications to be seamlessly integrated into its ecosystem. KubeSphere is also a multi-tenant container platform with full-stack automated IT operation and streamlined DevOps workflows. It provides developer-friendly wizard web UI, helping enterprises to build out a more robust and feature-rich platform, which includes most common functionalities needed for enterprise Kubernetes strategy, see Feature List for details. KubeSphere Lite provides you with free, stable, and out-of-the-box managed cluster service. After registration and login, you can easily create a K8s cluster with KubeSphere installed in only 5 seconds and experience feature-rich KubeSphere.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    LangKit

    LangKit

    An open-source toolkit for monitoring Language Learning Models (LLMs)

    LangKit is an open-source text metrics toolkit for monitoring language models. It offers an array of methods for extracting relevant signals from the input and/or output text, which are compatible with the open-source data logging library whylogs. Productionizing language models, including LLMs, comes with a range of risks due to the infinite amount of input combinations, which can elicit an infinite amount of outputs. The unstructured nature of text poses a challenge in the ML observability space - a challenge worth solving, since the lack of visibility on the model's behavior can have serious consequences.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Loggie

    Loggie

    A lightweight, cloud-native data transfer agent and aggregator

    Loggie is a lightweight, high-performance, cloud-native agent and aggregator based on Golang. Loggie includes LogConfig/ClusterLogConfig/Interceptor/Sink CRDs, allowing for the creation of data collection, transfer, processing, and sending pipelines through simple YAML file creation. Supports deployment as an independent intermediate machine, which can receive aggregated data sent by Loggie Agent and can also be used to consume and process various data sources. Configure Filebeat and Loggie to collect logs, and send them to a Kafka topic without using client compression, with the Kafka topic partition configured as 3. With sufficient resources for the Agent specification, modify the number of files collected, the concurrency of the sending client (configure Filebeat worker and Loggie parallelism), and observe their respective CPU, memory, and pod network card transmission rates.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    OTOMI

    OTOMI

    Self-hosted DevOps Platform for Kubernetes

    Otomi is an open source self-hosted PaaS to run on top of any Kubernetes cluster and is placed in the CNCF landscape under the PaaS/Container Service section. A PaaS attempts to connect many of the technologies found in the CNCF landscape in a way to provide direct value. Deploy containerized apps with a few click without writing any K8s YAML manifests. Get access to logs and metrics of deployed apps. Store charts and images in a private registry. Build and run custom CI pipelines. Enable declarative end-to-end app lifecycle management. Configure ingress for apps with a single click. Manage your own secrets. Onboard development teams on shared clusters in a comprehensive multi-tenant setup. Get all the required observability tools in an integrated way. Ensure governance with security policies. Implement zero-trust networking with east-west and north-south network control within K8s. Provide self-service features to development teams.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    OpenObserve

    OpenObserve

    Elasticsearch/Splunk/Datadog alternative for (logs, metrics, traces)

    OpenObserve is a cloud-native observability platform built specifically for logs, metrics, traces, and analytics designed to work at a petabyte scale. It is very simple and easy to operate as opposed to Elasticsearch which requires a couple of dozen knobs to understand and tune which you can get up and running in under 2 minutes. It is a drop-in replacement for Elasticsearch if you are just ingesting data using APIs and searching using Kibana (Kibana is not supported nor required with OpenObserve. OpenObserve provides its own UI which does not require separate installation unlike Kibana). You can reduce your log storage costs by ~140x compared to Elasticsearch by using OpenObserve. Below are the results when we pushed logs from our production Kubernetes cluster to Elasticsearch and OpenObserve using fluent bit. OpenObserve stored data in Amazon s3 and Elasticsearch stored data on Amazon EBS volumes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    SigNoz

    SigNoz

    SigNoz is an open-source APM. It helps developers monitor their apps

    Monitor your applications and troubleshoot problems in your deployed applications, an open-source alternative to DataDog, New Relic, etc. SigNoz helps developers monitor applications and troubleshoot problems in their deployed applications. SigNoz uses distributed tracing to gain visibility into your software stack. Visualise Metrics, Traces and Logs in a single pane of glass. You can see metrics like p99 latency, error rates for your services, external API calls and individual end points. You can find the root cause of the problem by going to the exact traces which are causing the problem and see detailed flamegraphs of individual request traces. Run aggregates on trace data to get business relevant metrics. Filter and query logs, build dashboards and alerts based on attributes in logs.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB