Compare the Top Data Lineage Tools that integrate with Looker as of May 2026

This a list of Data Lineage tools that integrate with Looker. Use the filters on the left to add additional filters for products that have integrations with Looker. View the products that work with Looker in the table below.

What are Data Lineage Tools for Looker?

Data lineage tools are software solutions designed to track and visualize the flow of data through various stages of its lifecycle, from origin to destination. These tools help organizations understand the data's journey, transformations, and dependencies across different systems and processes. They offer features such as data mapping, impact analysis, and auditing to ensure data accuracy, compliance, and governance. By providing detailed insights into data movement and transformations, data lineage tools enable better decision-making, troubleshooting, and optimization of data workflows. They are essential for maintaining data integrity and transparency in complex data environments. Compare and read user reviews of the best Data Lineage tools for Looker currently available using the table below. This list is updated regularly.

  • 1
    Sifflet

    Sifflet

    Sifflet

    Automatically cover thousands of tables with ML-based anomaly detection and 50+ custom metrics. Comprehensive data and metadata monitoring. Exhaustive mapping of all dependencies between assets, from ingestion to BI. Enhanced productivity and collaboration between data engineers and data consumers. Sifflet seamlessly integrates into your data sources and preferred tools and can run on AWS, Google Cloud Platform, and Microsoft Azure. Keep an eye on the health of your data and alert the team when quality criteria aren’t met. Set up in a few clicks the fundamental coverage of all your tables. Configure the frequency of runs, their criticality, and even customized notifications at the same time. Leverage ML-based rules to detect any anomaly in your data. No need for an initial configuration. A unique model for each rule learns from historical data and from user feedback. Complement the automated rules with a library of 50+ templates that can be applied to any asset.
  • 2
    Datameer

    Datameer

    Datameer

    Datameer revolutionizes data transformation with a low-code approach, trusted by top global enterprises. Craft, transform, and publish data seamlessly with no code and SQL, simplifying complex data engineering tasks. Empower your data teams to make informed decisions confidently while saving costs and ensuring responsible self-service analytics. Speed up your analytics workflow by transforming datasets to answer ad-hoc questions and support operational dashboards. Empower everyone on your team with our SQL or Drag-and-Drop to transform your data in an intuitive and collaborative workspace. And best of all, everything happens in Snowflake. Datameer is designed and optimized for Snowflake to reduce data movement and increase platform adoption. Some of the problems Datameer solves: - Analytics is not accessible - Drowning in backlog - Long development
  • 3
    Decube

    Decube

    Decube

    Decube is a data management platform that helps organizations manage their data observability, data catalog, and data governance needs. It provides end-to-end visibility into data and ensures its accuracy, consistency, and trustworthiness. Decube's platform includes data observability, a data catalog, and data governance components that work together to provide a comprehensive solution. The data observability tools enable real-time monitoring and detection of data incidents, while the data catalog provides a centralized repository for data assets, making it easier to manage and govern data usage and access. The data governance tools provide robust access controls, audit reports, and data lineage tracking to demonstrate compliance with regulatory requirements. Decube's platform is customizable and scalable, making it easy for organizations to tailor it to meet their specific data management needs and manage data across different systems, data sources, and departments.
  • 4
    Masthead

    Masthead

    Masthead

    See the impact of data issues without running SQL. We analyze your logs and metadata to identify freshness and volume anomalies, schema changes in tables, pipeline errors, and their blast radius effects on your business. Masthead observes every table, process, script, and dashboard in the data warehouse and connected BI tools for anomalies, alerting data teams in real time if any data failures occur. Masthead shows the origin and implications of data anomalies and pipeline errors on data consumers. Masthead maps data issues on lineage, so you can troubleshoot within minutes, not hours. We get a comprehensive view of all processes in GCP without giving access to our data was a game-changer for us. It saved us both time and money. Gain visibility into the cost of each pipeline running in your cloud, regardless of ETL. Masthead also has AI-powered recommendations to help you optimize your models and queries. It takes 15 min to connect Masthead to all assets in your data warehouse.
    Starting Price: $899 per month
  • 5
    Secoda

    Secoda

    Secoda

    With Secoda AI on top of your metadata, you can now get contextual search results from across your tables, columns, dashboards, metrics, and queries. Secoda AI can also help you generate documentation and queries from your metadata, saving your team hundreds of hours of mundane work and redundant data requests. Easily search across all columns, tables, dashboards, events, and metrics. AI-powered search lets you ask any question to your data and get a contextual answer, fast. Get answers to questions. Integrate data discovery into your workflow without disrupting it with our API. Perform bulk updates, tag PII data, manage tech debt, build custom integrations, identify the least used resources, and more. Eliminate manual error and have total trust in your knowledge repository.
    Starting Price: $50 per user per month
  • 6
    Y42

    Y42

    Datos-Intelligence GmbH

    Y42 is the first fully managed Modern DataOps Cloud. It is purpose-built to help companies easily design production-ready data pipelines on top of their Google BigQuery or Snowflake cloud data warehouse. Y42 provides native integration of best-of-breed open-source data tools, comprehensive data governance, and better collaboration for data teams. With Y42, organizations enjoy increased accessibility to data and can make data-driven decisions quickly and efficiently.
  • 7
    Select Star

    Select Star

    Select Star

    Set up your automated data catalog in just 15 minutes, and receive column-level lineage, Entity Relationship (ER) diagram, and auto-populated documentation within 24 hours. Easily find, tag, and add documentation to your data so everyone can find the right dataset for their use case. Select Star automatically detects and displays your column-level data lineage. You can now trust the data, knowing where it came from. Select Star automatically surfaces how your company uses data. That means you can identify relevant data fields without needing to ask someone else. Select Star treats your data with AICPA SOC 2 Security, Confidentiality, and Availability standards, making sure your data is always safe and sound.
    Starting Price: $270 per month
  • 8
    Metaplane

    Metaplane

    Metaplane

    Monitor your entire warehouse in 30 minutes. Identify downstream impact with automated warehouse-to-BI lineage. Trust takes seconds to lose and months to regain. Gain peace of mind with observability built for the modern data era. Code-based tests take hours to write and maintain, so it's hard to achieve the coverage you need. In Metaplane, you can add hundreds of tests within minutes. We support foundational tests (e.g. row counts, freshness, and schema drift), more complex tests (distribution drift, nullness shifts, enum changes), custom SQL, and everything in between. Manual thresholds take a long time to set and quickly go stale as your data changes. Our anomaly detection models learn from historical metadata to automatically detect outliers. Monitor what matters, all while accounting for seasonality, trends, and feedback from your team to minimize alert fatigue. Of course, you can override with manual thresholds, too.
    Starting Price: $825 per month
  • 9
    Blindata

    Blindata

    Blindata

    Blindata covers all the functions of a Data Governance program: Business Glossary, Data Catalog & Data Lineage build an integrated and complete view on your Data. Data Classification module gives a semantic meaning to the data while the Data Quality, Issue Management & Data Stewardship modules improve the reliability and trust on data. Moreover, privacy compliance can leverage specific features: registry of processing activities, centralized privacy note management, consent registry with Blockchain integrated notarization. Blindata Agent can connect to different data sources, collecting metadata such data structures (Tables, Views, Fields, …), data quality metrics, reverse lineage, etc. Blindata has a modular and entirely API based architecture allowing systematic integration with the most critical business systems (DBMS, Active Directory, e-commerce, Data Platforms). Blindata is available as SaaS, can be installed “on Premise” or purchased on AWS Marketplace.
    Starting Price: $1000/year/user
  • 10
    Foundational

    Foundational

    Foundational

    Identify code and optimization issues in real-time, prevent data incidents pre-deploy, and govern data-impacting code changes end to end—from the operational database to the user-facing dashboard. Automated, column-level data lineage, from the operational database all the way to the reporting layer, ensures every dependency is analyzed. Foundational automates data contract enforcement by analyzing every repository from upstream to downstream, directly from source code. Use Foundational to proactively identify code and data issues, find and prevent issues, and create controls and guardrails. Foundational can be set up in minutes with no code changes required.
  • 11
    Trifacta

    Trifacta

    Trifacta

    The fastest way to prep data and build data pipelines in the cloud. Trifacta provides visual and intelligent guidance to accelerate data preparation so you can get to insights faster. Poor data quality can sink any analytics project. Trifacta helps you understand your data so you can quickly and accurately clean it up. All the power with none of the code. Trifacta provides visual and intelligent guidance so you can get to insights faster. Manual, repetitive data preparation processes don’t scale. Trifacta helps you build, deploy and manage self-service data pipelines in minutes not months.
  • 12
    DataGalaxy

    DataGalaxy

    DataGalaxy

    DataGalaxy is a next-generation data governance and intelligence platform designed to help organizations manage, understand, and maximize the value of their data. Built around a unified interface, it empowers everyone—from executives to data consumers—to collaborate seamlessly across data assets, strategies, and analytics. The platform’s automated data catalog, governance hub, and AI co-pilot reduce manual work while ensuring compliance and data quality across systems. With over 70+ integrations, including Snowflake, Databricks, Power BI, and AWS, DataGalaxy connects your data ecosystem into a single source of truth. Its value tracking center and strategy cockpit align data initiatives with business goals, driving measurable outcomes and enterprise-wide visibility. Loved by users, DataGalaxy turns governance into a strategic advantage for the modern enterprise.
  • 13
    Dremio

    Dremio

    Dremio

    Dremio delivers lightning-fast queries and a self-service semantic layer directly on your data lake storage. No moving data to proprietary data warehouses, no cubes, no aggregation tables or extracts. Just flexibility and control for data architects, and self-service for data consumers. Dremio technologies like Data Reflections, Columnar Cloud Cache (C3) and Predictive Pipelining work alongside Apache Arrow to make queries on your data lake storage very, very fast. An abstraction layer enables IT to apply security and business meaning, while enabling analysts and data scientists to explore data and derive new virtual datasets. Dremio’s semantic layer is an integrated, searchable catalog that indexes all of your metadata, so business users can easily make sense of your data. Virtual datasets and spaces make up the semantic layer, and are all indexed and searchable.
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB