Compare the Top Data Lineage Tools in the UK as of November 2024

What are Data Lineage Tools in the UK?

Data lineage tools are software solutions designed to track and visualize the flow of data through various stages of its lifecycle, from origin to destination. These tools help organizations understand the data's journey, transformations, and dependencies across different systems and processes. They offer features such as data mapping, impact analysis, and auditing to ensure data accuracy, compliance, and governance. By providing detailed insights into data movement and transformations, data lineage tools enable better decision-making, troubleshooting, and optimization of data workflows. They are essential for maintaining data integrity and transparency in complex data environments. Compare and read user reviews of the best Data Lineage tools in the UK currently available using the table below. This list is updated regularly.

  • 1
    DataBuck

    DataBuck

    FirstEigen

    (Bank CFO) “I don’t have confidence and trust in our data. We keep discovering hidden risks”. Since 70% of data initiatives fail due to unreliable data (Gartner research), are you risking your reputation by trusting the accuracy of your data that you share with your business stakeholders and partners? Data Trust Scores must be measured in Data Lakes, warehouses, and throughout the pipeline, to ensure the data is trustworthy and fit for use. It typically takes 4-6 weeks of manual effort just to set a file or table for validation. Then, the rules have to be constantly updated as the data evolves. The only scalable option is to automate data validation rules discovery and rules maintenance. DataBuck is an autonomous, self-learning, Data Observability, Quality, Trustability and Data Matching tool. It reduces effort by 90% and errors by 70%. "What took my team of 10 Engineers 2 years to do, DataBuck could complete it in less than 8 hours." (VP, Enterprise Data Office, a US bank)
    View Tool
    Visit Website
  • 2
    MANTA

    MANTA

    Manta

    Manta is the world-class automated approach to visualize, optimize, and modernize how data moves through your organization through code-level lineage. By automatically scanning your data environment with the power of 50+ out-of-the-box scanners, Manta builds a powerful map of all data pipelines to drive efficiency and productivity. Visit manta.io to learn more. With Manta platform, you can make your data a truly enterprise-wide asset, bridge the understanding gap, enable self-service, and easily: • Increase productivity • Accelerate development • Shorten time-to-market • Reduce costs and manual effort • Run instant and accurate root cause and impact analyses • Scope and perform effective cloud migrations • Improve data governance and regulatory compliance (GDPR, CCPA, HIPAA, and more) • Increase data quality • Enhance data privacy and data security
  • 3
    CloverDX

    CloverDX

    CloverDX

    Design, debug, run and troubleshoot data transformations and jobflows in a developer-friendly visual designer. Orchestrate data workloads that require tasks to be carried out in the right sequence, orchestrate multiple systems with the transparency of visual workflows. Deploy data workloads easily into a robust enterprise runtime environment. In cloud or on-premise. Make data available to people, applications and storage under a single unified platform. Manage your data workloads and related processes together in a single platform. No task is too complex. We’ve built CloverDX on years of experience with large enterprise projects. Developer-friendly open architecture and flexibility lets you package and hide the complexity for non-technical users. Manage the entire lifecycle of a data pipeline from design, deployment to evolution and testing. Get things done fast with the help of our in-house customer success teams.
    Starting Price: $5000.00/one-time
  • 4
    OvalEdge

    OvalEdge

    OvalEdge

    OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.
    Starting Price: $1,300/month
  • 5
    Alation

    Alation

    Alation

    Alation is the first company to bring a data catalog to market. It radically improves how people find, understand, trust, use, and reuse data. Alation pioneered active, non-invasive data governance, which supports both data democratization and compliance at scale, so people have the data they need alongside guidance on how to use it correctly. By combining human insight with AI and machine learning, Alation tackles the toughest challenges in data today. More than 350 enterprises use Alation to make confident, data-driven decisions. American Family Insurance, Exelon, Munich Re, and Pfizer are all proud customers.
  • 6
    Microsoft Purview
    Microsoft Purview is a unified data governance service that helps you manage and govern your on-premises, multicloud, and software-as-a-service (SaaS) data. Easily create a holistic, up-to-date map of your data landscape with automated data discovery, sensitive data classification, and end-to-end data lineage. Empower data consumers to find valuable, trustworthy data. Automated data discovery, lineage identification, and data classification across on-premises, multicloud, and SaaS sources. Unified map of your data assets and their relationships for more effective governance. Semantic search enables data discovery using business or technical terms. Insight into the location and movement of sensitive data across your hybrid data landscape. Establish the foundation for effective data usage and governance with Purview Data Map. Automate and manage metadata from hybrid sources. Classify data using built-in and custom classifiers and Microsoft Information Protection sensitivity labels.
    Starting Price: $0.342
  • 7
    Datameer

    Datameer

    Datameer

    Datameer revolutionizes data transformation with a low-code approach, trusted by top global enterprises. Craft, transform, and publish data seamlessly with no code and SQL, simplifying complex data engineering tasks. Empower your data teams to make informed decisions confidently while saving costs and ensuring responsible self-service analytics. Speed up your analytics workflow by transforming datasets to answer ad-hoc questions and support operational dashboards. Empower everyone on your team with our SQL or Drag-and-Drop to transform your data in an intuitive and collaborative workspace. And best of all, everything happens in Snowflake. Datameer is designed and optimized for Snowflake to reduce data movement and increase platform adoption. Some of the problems Datameer solves: - Analytics is not accessible - Drowning in backlog - Long development
  • 8
    Immuta

    Immuta

    Immuta

    Immuta is the market leader in secure Data Access, providing data teams one universal platform to control access to analytical data sets in the cloud. Only Immuta can automate access to data by discovering, securing, and monitoring data. Data-driven organizations around the world trust Immuta to speed time to data, safely share more data with more users, and mitigate the risk of data leaks and breaches. Founded in 2015, Immuta is headquartered in Boston, MA. Immuta is the fastest way for algorithm-driven enterprises to accelerate the development and control of machine learning and advanced analytics. The company's hyperscale data management platform provides data scientists with rapid, personalized data access to dramatically improve the creation, deployment and auditability of machine learning and AI.
  • 9
    Dataedo

    Dataedo

    Dataedo

    Discover, document and manage your metadata. Dataedo is equipped with multiple automated metadata scanners that connect to various database technologies, extract data structures and metadata, and load them into the metadata repository. With a few clicks, build a catalog of your data and describe each element. Decrypt table and column names with business-friendly aliases, provide meaning and purpose of data assets with descriptions and user-defined custom fields. Use sample data to learn what data is stored in your data assets. Understand the data better before using it and make sure that the data is good quality. Ensure high data quality with data profiling. Democratize access to knowledge about data. Build data literacy, democratize data and empower everyone in your organization to make better use of your data with a lightweight on-premises data catalog. Boost data literacy through a data catalog.
    Starting Price: $49 per month
  • 10
    Decube

    Decube

    Decube

    Decube is a data management platform that helps organizations manage their data observability, data catalog, and data governance needs. It provides end-to-end visibility into data and ensures its accuracy, consistency, and trustworthiness. Decube's platform includes data observability, a data catalog, and data governance components that work together to provide a comprehensive solution. The data observability tools enable real-time monitoring and detection of data incidents, while the data catalog provides a centralized repository for data assets, making it easier to manage and govern data usage and access. The data governance tools provide robust access controls, audit reports, and data lineage tracking to demonstrate compliance with regulatory requirements. Decube's platform is customizable and scalable, making it easy for organizations to tailor it to meet their specific data management needs and manage data across different systems, data sources, and departments.
  • 11
    Castor

    Castor

    Castor

    Castor is a data catalog designed for mass adoption across the whole company. Have an overview of all your data environment. Search for data instantly thanks to our powerful search engine. Onboard to a new data infrastructure and access data in a breeze. Go beyond your traditional data catalog. Modern data teams now have numerous data sources, build one truth. With its delightful and automated documentation experience, Castor makes it dead simple to trust data. Column-level, cross-system data lineage in minutes. Get a bird’s eye view of your data pipelines to build trust in your data. Troubleshoot data issues, perform impact analyses, comply with GDPR in one tool. Optimize performance, cost, compliance, and security for your data. Keep your data stack healthy with our automated infrastructure monitoring system.
    Starting Price: $699 per month
  • 12
    Weld

    Weld

    Weld

    Create, edit and organize your data models. No need to get yet another data tool for your data models. Create and manage them in Weld. Packed with features that will make creating your data models a breeze: smart autocomplete, code folding, error highlighting, audit logs, version control and collaboration. Plus, we use the same text editor as VS Code – it's fast, powerful and easy on the eye. Your queries are organized in an easily searchable and accessible library. Audit logs also let you see when the query was last updated, and by who. Weld Model supports materializing models as tables, incremental tables, views, or a custom materialization of your design. Run all your data operations in one simple platform – with help from a dedicated team of data analysts.
    Starting Price: €750 per month
  • 13
    Ataccama ONE
    Ataccama reinvents the way data is managed to create value on an enterprise scale. Unifying Data Governance, Data Quality, and Master Data Management into a single, AI-powered fabric across hybrid and Cloud environments, Ataccama gives your business and data teams the ability to innovate with unprecedented speed while maintaining trust, security, and governance of your data.
  • 14
    Securiti

    Securiti

    Securiti

    Securiti is the pioneer of the Data Command Center, a centralized platform that enables the safe use of data and GenAI. It provides unified data intelligence, controls and orchestration across hybrid multicloud environments. Large global enterprises rely on Securiti's Data Command Center for data security, privacy, governance, and compliance. Securiti has been recognized with numerous industry and analyst awards, including "Most Innovative Startup" by RSA, "Top 25 Machine Learning Startups" by Forbes, "Most Innovative AI Companies" by CB Insights, "Cool Vendor in Data Security" by Gartner, and "Privacy Management Wave Leader" by Forrester. For more information, please follow us on LinkedIn and visit Securiti.ai.
  • 15
    SQLFlow

    SQLFlow

    Gudu Software

    SQLFlow provides a visual representation of the overall flow of data. Automated SQL data lineage analysis across databases, ETL, business intelligence, cloud and Hadoop environments by parsing SQL Script and stored procedure. Depict all the data movement graphically. Support more than 20 major databases and still growing. Provide automation in building the lineage no matter where the SQL resides: databases, file system, Github, Bitbucket and etc. Shows data flows in a way that is user-friendly, clear, and understandable. Get full visibility into your BI environment. Discovering Root-Cause of Reporting Errors, creates invaluable business confidence. Simplify regulatory compliance. The visualization of data lineage provide greater transparency and audit ability. Enable impact analysis at a granular level, drill down into table, column, and query-level lineage. Add the powerful data lineage analysis capability to your product instantly.
    Starting Price: $49.99 per month
  • 16
    erwin Data Intelligence
    erwin Data Intelligence (erwin DI) combines data catalog and data literacy capabilities for greater awareness of and access to available data assets, guidance on their use, and guardrails to ensure data policies and best practices are followed. Automatically harvest, transform and feed metadata from a wide array of data sources, operational processes, business applications and data models into a central catalog. Then make it accessible and understandable via role-based, contextual views so stakeholders can make strategic decisions based on accurate insights. erwin DI supports enterprise data governance, digital transformation and any effort that relies on data for favorable outcomes. Schedule ongoing scans of metadata from the widest array of data sources. Easily map data elements from source to target, including data in motion, and harmonize data integration across platforms. Enable data consumers to define and discover data relevant to their roles.
    Starting Price: $299 per month
  • 17
    Y42

    Y42

    Datos-Intelligence GmbH

    Y42 is the first fully managed Modern DataOps Cloud. It is purpose-built to help companies easily design production-ready data pipelines on top of their Google BigQuery or Snowflake cloud data warehouse. Y42 provides native integration of best-of-breed open-source data tools, comprehensive data governance, and better collaboration for data teams. With Y42, organizations enjoy increased accessibility to data and can make data-driven decisions quickly and efficiently.
  • 18
    PHEMI Health DataLab
    The PHEMI Trustworthy Health DataLab is a unique, cloud-based, integrated big data management system that allows healthcare organizations to enhance innovation and generate value from healthcare data by simplifying the ingestion and de-identification of data with NSA/military-grade governance, privacy, and security built-in. Conventional products simply lock down data, PHEMI goes further, solving privacy and security challenges and addressing the urgent need to secure, govern, curate, and control access to privacy-sensitive personal healthcare information (PHI). This improves data sharing and collaboration inside and outside of an enterprise—without compromising the privacy of sensitive information or increasing administrative burden. PHEMI Trustworthy Health DataLab can scale to any size of organization, is easy to deploy and manage, connects to hundreds of data sources, and integrates with popular data science and business analysis tools.
  • 19
    Mozart Data

    Mozart Data

    Mozart Data

    Mozart Data is the all-in-one modern data platform that makes it easy to consolidate, organize, and analyze data. Start making data-driven decisions by setting up a modern data stack in an hour - no engineering required.
  • 20
    Select Star

    Select Star

    Select Star

    Set up your automated data catalog in just 15 minutes, and receive column-level lineage, Entity Relationship (ER) diagram, and auto-populated documentation within 24 hours. Easily find, tag, and add documentation to your data so everyone can find the right dataset for their use case. Select Star automatically detects and displays your column-level data lineage. You can now trust the data, knowing where it came from. Select Star automatically surfaces how your company uses data. That means you can identify relevant data fields without needing to ask someone else. Select Star treats your data with AICPA SOC 2 Security, Confidentiality, and Availability standards, making sure your data is always safe and sound.
    Starting Price: $270 per month
  • 21
    Metaplane

    Metaplane

    Metaplane

    Monitor your entire warehouse in 30 minutes. Identify downstream impact with automated warehouse-to-BI lineage. Trust takes seconds to lose and months to regain. Gain peace of mind with observability built for the modern data era. Code-based tests take hours to write and maintain, so it's hard to achieve the coverage you need. In Metaplane, you can add hundreds of tests within minutes. We support foundational tests (e.g. row counts, freshness, and schema drift), more complex tests (distribution drift, nullness shifts, enum changes), custom SQL, and everything in between. Manual thresholds take a long time to set and quickly go stale as your data changes. Our anomaly detection models learn from historical metadata to automatically detect outliers. Monitor what matters, all while accounting for seasonality, trends, and feedback from your team to minimize alert fatigue. Of course, you can override with manual thresholds, too.
    Starting Price: $825 per month
  • 22
    Masthead

    Masthead

    Masthead

    See the impact of data issues without running SQL. We analyze your logs and metadata to identify freshness and volume anomalies, schema changes in tables, pipeline errors, and their blast radius effects on your business. Masthead observes every table, process, script, and dashboard in the data warehouse and connected BI tools for anomalies, alerting data teams in real time if any data failures occur. Masthead shows the origin and implications of data anomalies and pipeline errors on data consumers. Masthead maps data issues on lineage, so you can troubleshoot within minutes, not hours. We get a comprehensive view of all processes in GCP without giving access to our data was a game-changer for us. It saved us both time and money. Gain visibility into the cost of each pipeline running in your cloud, regardless of ETL. Masthead also has AI-powered recommendations to help you optimize your models and queries. It takes 15 min to connect Masthead to all assets in your data warehouse.
    Starting Price: $899 per month
  • 23
    Secoda

    Secoda

    Secoda

    With Secoda AI on top of your metadata, you can now get contextual search results from across your tables, columns, dashboards, metrics, and queries. Secoda AI can also help you generate documentation and queries from your metadata, saving your team hundreds of hours of mundane work and redundant data requests. Easily search across all columns, tables, dashboards, events, and metrics. AI-powered search lets you ask any question to your data and get a contextual answer, fast. Get answers to questions. Integrate data discovery into your workflow without disrupting it with our API. Perform bulk updates, tag PII data, manage tech debt, build custom integrations, identify the least used resources, and more. Eliminate manual error and have total trust in your knowledge repository.
    Starting Price: $50 per user per month
  • 24
    Octopai

    Octopai

    Octopai

    Harness the power of data lineage, discovery and a data catalog to achieve full control of your data. that can instantly navigate through the most complex data landscapes. Gain access to the most comprehensive automated data lineage, discovery and data catalog. Providing unprecedented visibility and trust into the most complex data environments. Octopai extracts metadata from your entire data environment. With a quick, secure and simple process, Octopai will instantly be able to analyze the metadata. In one centralized platform Octopai allows you to access data lineage, data discovery and a data catalog, automatically. Trace any data end-to-end through your entire data landscape, in seconds. Automatically find the data you need anywhere in your data landscape. Create company-wide consistency with a self-creating, self-updating data catalog.
  • 25
    Databricks Data Intelligence Platform
    The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data. The winners in every industry will be data and AI companies. From ETL to data warehousing to generative AI, Databricks helps you simplify and accelerate your data and AI goals. Databricks combines generative AI with the unification benefits of a lakehouse to power a Data Intelligence Engine that understands the unique semantics of your data. This allows the Databricks Platform to automatically optimize performance and manage infrastructure in ways unique to your business. The Data Intelligence Engine understands your organization’s language, so search and discovery of new data is as easy as asking a question like you would to a coworker.
  • 26
    Jaspersoft

    Jaspersoft

    TIBCO Software

    Jaspersoft is the customizable reporting and analytics platform you need to succeed. We’ve spent over two decades perfecting our platform so you can deliver the data visualizations and analytics your customers want, from high volumes of pixel perfect reports to self-service ad hoc reports and more. Jaspersoft helps you deliver the reporting and analytics your customers want, without burdening your development team
  • 27
    Atlan

    Atlan

    Atlan

    The modern data workspace. Make all your data assets from data tables to BI reports, instantly discoverable. Our powerful search algorithms combined with easy browsing experience, make finding the right asset, a breeze. Atlan auto-generates data quality profiles which make detecting bad data, dead easy. From automatic variable type detection & frequency distribution to missing values and outlier detection, we’ve got you covered. Atlan takes the pain away from governing and managing your data ecosystem! Atlan’s bots parse through SQL query history to auto construct data lineage and auto-detect PII data, allowing you to create dynamic access policies & best in class governance. Even non-technical users can directly query across multiple data lakes, warehouses & DBs using our excel-like query builder. Native integrations with tools like Tableau and Jupyter makes data collaboration come alive.
  • 28
    SolarWinds Database Mapper
    Want an easier way to automatically generate documentation from multiple data sources? Wish you had a better understanding of the origin of your data—and who's handled that data? SolarWinds Database Mapper (formerly SentryOne Document) delivers powerful documentation and data lineage analysis capabilities in a cloud or software solution. With SolarWinds Database Mapper, you can easily maintain up-to-date documentation and ensure compliance with business rules and data privacy regulations by accurately tracking data lineage. SolarWinds Database Mapper gives you powerful tools for ensuring your databases are continuously and accurately documented. Plus, the data lineage analysis capabilities help you ensure compliance by providing a visual representation of your data's origin. Track data lineage with a visual display that clearly shows data dependencies across your environment. Easily manage documentation tasks and view logs with an easy-to-access cloud or software solution.
  • 29
    Axon Data Governance
    Your teams need consistent, trusted data to support data-driven decision making. Make sure they have it with integrated, automated, intelligent data governance at scale. Axon Data Governance is the collaboration hub and data marketplace for successful, scalable data governance programs. Easily identify stakeholders and facilitate knowledge transfer across communities so teams can learn from each other. Ensure that teams can quickly find, access, and understand the data they need to uncover analytics insights with a carefully curated data marketplace. Use governed data to fuel key initiatives (such as improving customer experience) and deliver consistent, trusted results across your organization. Build governance and data privacy into your processes and projects from the start to support compliance with regulations like GDPR and CCPA. Develop a common data dictionary to provide a consistent source of business context across multiple tools.
  • 30
    Privacera

    Privacera

    Privacera

    At the intersection of data governance, privacy, and security, Privacera’s unified data access governance platform maximizes the value of data by providing secure data access control and governance across hybrid- and multi-cloud environments. The hybrid platform centralizes access and natively enforces policies across multiple cloud services—AWS, Azure, Google Cloud, Databricks, Snowflake, Starburst and more—to democratize trusted data enterprise-wide without compromising compliance with regulations such as GDPR, CCPA, LGPD, or HIPAA. Trusted by Fortune 500 customers across finance, insurance, retail, healthcare, media, public and the federal sector, Privacera is the industry’s leading data access governance platform that delivers unmatched scalability, elasticity, and performance. Headquartered in Fremont, California, Privacera was founded in 2016 to manage cloud data privacy and security by the creators of Apache Ranger™ and Apache Atlas™.
  • Previous
  • You're on page 1
  • 2
  • Next