Compare the Top Data Lineage Tools that integrate with Cloudera as of December 2025

This a list of Data Lineage tools that integrate with Cloudera. Use the filters on the left to add additional filters for products that have integrations with Cloudera. View the products that work with Cloudera in the table below.

What are Data Lineage Tools for Cloudera?

Data lineage tools are software solutions designed to track and visualize the flow of data through various stages of its lifecycle, from origin to destination. These tools help organizations understand the data's journey, transformations, and dependencies across different systems and processes. They offer features such as data mapping, impact analysis, and auditing to ensure data accuracy, compliance, and governance. By providing detailed insights into data movement and transformations, data lineage tools enable better decision-making, troubleshooting, and optimization of data workflows. They are essential for maintaining data integrity and transparency in complex data environments. Compare and read user reviews of the best Data Lineage tools for Cloudera currently available using the table below. This list is updated regularly.

  • 1
    OvalEdge

    OvalEdge

    OvalEdge

    OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.
    Starting Price: $1,300/month
  • 2
    Alation

    Alation

    Alation

    The Alation Agentic Data Intelligence Platform enables organizations to scale and accelerate their AI and data initiatives. By unifying search, cataloging, governance, lineage, and analytics, it transforms metadata into a strategic asset for decision-making. The platform’s AI-powered agents—including Documentation, Data Quality, and Data Products Builder—automate complex data management tasks. With active metadata, workflow automation, and more than 120 pre-built connectors, Alation integrates seamlessly into modern enterprise environments. It helps organizations build trusted AI models by ensuring data quality, transparency, and compliance across the business. Trusted by 40% of the Fortune 100, Alation empowers teams to make faster, more confident decisions with trusted data.
  • 3
    Immuta

    Immuta

    Immuta

    Immuta is the market leader in secure Data Access, providing data teams one universal platform to control access to analytical data sets in the cloud. Only Immuta can automate access to data by discovering, securing, and monitoring data. Data-driven organizations around the world trust Immuta to speed time to data, safely share more data with more users, and mitigate the risk of data leaks and breaches. Founded in 2015, Immuta is headquartered in Boston, MA. Immuta is the fastest way for algorithm-driven enterprises to accelerate the development and control of machine learning and advanced analytics. The company's hyperscale data management platform provides data scientists with rapid, personalized data access to dramatically improve the creation, deployment and auditability of machine learning and AI.
  • 4
    Ataccama ONE
    Ataccama reinvents the way data is managed to create value on an enterprise scale. Unifying Data Governance, Data Quality, and Master Data Management into a single, AI-powered fabric across hybrid and Cloud environments, Ataccama gives your business and data teams the ability to innovate with unprecedented speed while maintaining trust, security, and governance of your data.
  • 5
    PHEMI Health DataLab
    The PHEMI Trustworthy Health DataLab is a unique, cloud-based, integrated big data management system that allows healthcare organizations to enhance innovation and generate value from healthcare data by simplifying the ingestion and de-identification of data with NSA/military-grade governance, privacy, and security built-in. Conventional products simply lock down data, PHEMI goes further, solving privacy and security challenges and addressing the urgent need to secure, govern, curate, and control access to privacy-sensitive personal healthcare information (PHI). This improves data sharing and collaboration inside and outside of an enterprise—without compromising the privacy of sensitive information or increasing administrative burden. PHEMI Trustworthy Health DataLab can scale to any size of organization, is easy to deploy and manage, connects to hundreds of data sources, and integrates with popular data science and business analysis tools.
  • 6
    Talend Data Catalog
    Talend Data Catalog gives your organization a single, secure point of control for your data. With robust tools for search and discovery, and connectors to extract metadata from virtually any data source, Data Catalog makes it easy to protect your data, govern your analytics, manage data pipelines, and accelerate your ETL processes. Data Catalog automatically crawls, profiles, organizes, links, and enriches all your metadata. Up to 80% of the information associated with the data is documented automatically and kept up-to-date through smart relationships and machine learning, continually delivering the most current data to the user. Make data governance a team sport with a secure single point of control where you can collaborate to improve data accessibility, accuracy, and business relevance. Support data privacy and regulatory compliance with intelligent data lineage tracing and compliance tracking.
  • Previous
  • You're on page 1
  • Next