Alternatives to Validio

Compare Validio alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Validio in 2024. Compare features, ratings, user reviews, pricing, and more from Validio competitors and alternatives in order to make an informed decision for your business.

  • 1
    DataBuck

    DataBuck

    FirstEigen

    (Bank CFO) “I don’t have confidence and trust in our data. We keep discovering hidden risks”. Since 70% of data initiatives fail due to unreliable data (Gartner research), are you risking your reputation by trusting the accuracy of your data that you share with your business stakeholders and partners? Data Trust Scores must be measured in Data Lakes, warehouses, and throughout the pipeline, to ensure the data is trustworthy and fit for use. It typically takes 4-6 weeks of manual effort just to set a file or table for validation. Then, the rules have to be constantly updated as the data evolves. The only scalable option is to automate data validation rules discovery and rules maintenance. DataBuck is an autonomous, self-learning, Data Observability, Quality, Trustability and Data Matching tool. It reduces effort by 90% and errors by 70%. "What took my team of 10 Engineers 2 years to do, DataBuck could complete it in less than 8 hours." (VP, Enterprise Data Office, a US bank)
    Compare vs. Validio View Software
    Visit Website
  • 2
    MANTA

    MANTA

    Manta

    Manta is the world-class automated approach to visualize, optimize, and modernize how data moves through your organization through code-level lineage. By automatically scanning your data environment with the power of 50+ out-of-the-box scanners, Manta builds a powerful map of all data pipelines to drive efficiency and productivity. Visit manta.io to learn more. With Manta platform, you can make your data a truly enterprise-wide asset, bridge the understanding gap, enable self-service, and easily: • Increase productivity • Accelerate development • Shorten time-to-market • Reduce costs and manual effort • Run instant and accurate root cause and impact analyses • Scope and perform effective cloud migrations • Improve data governance and regulatory compliance (GDPR, CCPA, HIPAA, and more) • Increase data quality • Enhance data privacy and data security
  • 3
    Immuta

    Immuta

    Immuta

    Immuta is the market leader in secure Data Access, providing data teams one universal platform to control access to analytical data sets in the cloud. Only Immuta can automate access to data by discovering, securing, and monitoring data. Data-driven organizations around the world trust Immuta to speed time to data, safely share more data with more users, and mitigate the risk of data leaks and breaches. Founded in 2015, Immuta is headquartered in Boston, MA. Immuta is the fastest way for algorithm-driven enterprises to accelerate the development and control of machine learning and advanced analytics. The company's hyperscale data management platform provides data scientists with rapid, personalized data access to dramatically improve the creation, deployment and auditability of machine learning and AI.
  • 4
    Sifflet

    Sifflet

    Sifflet

    Automatically cover thousands of tables with ML-based anomaly detection and 50+ custom metrics. Comprehensive data and metadata monitoring. Exhaustive mapping of all dependencies between assets, from ingestion to BI. Enhanced productivity and collaboration between data engineers and data consumers. Sifflet seamlessly integrates into your data sources and preferred tools and can run on AWS, Google Cloud Platform, and Microsoft Azure. Keep an eye on the health of your data and alert the team when quality criteria aren’t met. Set up in a few clicks the fundamental coverage of all your tables. Configure the frequency of runs, their criticality, and even customized notifications at the same time. Leverage ML-based rules to detect any anomaly in your data. No need for an initial configuration. A unique model for each rule learns from historical data and from user feedback. Complement the automated rules with a library of 50+ templates that can be applied to any asset.
  • 5
    OvalEdge

    OvalEdge

    OvalEdge

    OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.
    Starting Price: $1,300/month
  • 6
    Metaplane

    Metaplane

    Metaplane

    Monitor your entire warehouse in 30 minutes. Identify downstream impact with automated warehouse-to-BI lineage. Trust takes seconds to lose and months to regain. Gain peace of mind with observability built for the modern data era. Code-based tests take hours to write and maintain, so it's hard to achieve the coverage you need. In Metaplane, you can add hundreds of tests within minutes. We support foundational tests (e.g. row counts, freshness, and schema drift), more complex tests (distribution drift, nullness shifts, enum changes), custom SQL, and everything in between. Manual thresholds take a long time to set and quickly go stale as your data changes. Our anomaly detection models learn from historical metadata to automatically detect outliers. Monitor what matters, all while accounting for seasonality, trends, and feedback from your team to minimize alert fatigue. Of course, you can override with manual thresholds, too.
    Starting Price: $825 per month
  • 7
    Aggua

    Aggua

    Aggua

    Aggua is a data fabric augmented AI platform that enables data and business teams Access to their data, creating Trust and giving practical Data Insights, for a more holistic, data-centric decision-making. Instead of wondering what is going on underneath the hood of your organization's data stack, become immediately informed with a few clicks. Get access to data cost insights, data lineage and documentation without needing to take time out of your data engineer's workday. Instead of spending a lot of time tracing what a data type change will break in your data pipelines, tables and infrastructure, with automated lineage, your data architects and engineers can spend less time manually going through logs and DAGs and more time actually making the changes to infrastructure.
  • 8
    Decube

    Decube

    Decube

    Decube is a data management platform that helps organizations manage their data observability, data catalog, and data governance needs. It provides end-to-end visibility into data and ensures its accuracy, consistency, and trustworthiness. Decube's platform includes data observability, a data catalog, and data governance components that work together to provide a comprehensive solution. The data observability tools enable real-time monitoring and detection of data incidents, while the data catalog provides a centralized repository for data assets, making it easier to manage and govern data usage and access. The data governance tools provide robust access controls, audit reports, and data lineage tracking to demonstrate compliance with regulatory requirements. Decube's platform is customizable and scalable, making it easy for organizations to tailor it to meet their specific data management needs and manage data across different systems, data sources, and departments.
  • 9
    Dataedo

    Dataedo

    Dataedo

    Discover, document and manage your metadata. Dataedo is equipped with multiple automated metadata scanners that connect to various database technologies, extract data structures and metadata, and load them into the metadata repository. With a few clicks, build a catalog of your data and describe each element. Decrypt table and column names with business-friendly aliases, provide meaning and purpose of data assets with descriptions and user-defined custom fields. Use sample data to learn what data is stored in your data assets. Understand the data better before using it and make sure that the data is good quality. Ensure high data quality with data profiling. Democratize access to knowledge about data. Build data literacy, democratize data and empower everyone in your organization to make better use of your data with a lightweight on-premises data catalog. Boost data literacy through a data catalog.
    Starting Price: $49 per month
  • 10
    Data360 Govern

    Data360 Govern

    Precisely

    Your organization knows the value of data and the need to get it into the hands of business users for maximum impact, but without enterprise data governance, that data might be hard to find, understand, and trust. Data360 Govern is an enterprise data governance, catalog, and metadata management solution that gives you confidence in the quality, value, and trustworthiness of your data. It automates governance and stewardship tasks to help you answer essential questions about your data’s source, use, meaning, ownership, and quality. With Data360 Govern, you can make faster decisions on data usage and management, build collaboration across your entire organization, and allow users to get the answers they need – when they need them. Transparency into your organization’s data landscape gives you the power to track the critical data aligned with your most important business outcomes.
  • 11
    Talend Data Catalog
    Talend Data Catalog gives your organization a single, secure point of control for your data. With robust tools for search and discovery, and connectors to extract metadata from virtually any data source, Data Catalog makes it easy to protect your data, govern your analytics, manage data pipelines, and accelerate your ETL processes. Data Catalog automatically crawls, profiles, organizes, links, and enriches all your metadata. Up to 80% of the information associated with the data is documented automatically and kept up-to-date through smart relationships and machine learning, continually delivering the most current data to the user. Make data governance a team sport with a secure single point of control where you can collaborate to improve data accessibility, accuracy, and business relevance. Support data privacy and regulatory compliance with intelligent data lineage tracing and compliance tracking.
  • 12
    Atlan

    Atlan

    Atlan

    The modern data workspace. Make all your data assets from data tables to BI reports, instantly discoverable. Our powerful search algorithms combined with easy browsing experience, make finding the right asset, a breeze. Atlan auto-generates data quality profiles which make detecting bad data, dead easy. From automatic variable type detection & frequency distribution to missing values and outlier detection, we’ve got you covered. Atlan takes the pain away from governing and managing your data ecosystem! Atlan’s bots parse through SQL query history to auto construct data lineage and auto-detect PII data, allowing you to create dynamic access policies & best in class governance. Even non-technical users can directly query across multiple data lakes, warehouses & DBs using our excel-like query builder. Native integrations with tools like Tableau and Jupyter makes data collaboration come alive.
  • 13
    Collibra

    Collibra

    Collibra

    With a best-in-class catalog, flexible governance, continuous quality, and built-in privacy, the Collibra Data Intelligence Cloud is your single system of engagement for data. Support your users with a best-in-class data catalog that includes embedded governance, privacy and quality. Raise the grade, by ensuring teams can quickly find, understand and access data across sources, business applications, BI and data science tools in one central location. Give your data some much-needed privacy. Centralize, automate and guide workflows to encourage collaboration, operationalize privacy and address global regulatory requirements. Get the full story around your data with Collibra Data Lineage. Automatically map relationships between systems, applications and reports to provide a context-rich view across the enterprise. Hone in on the data you care about most and trust that it is relevant, complete and trustworthy.
  • 14
    erwin Data Intelligence
    erwin Data Intelligence (erwin DI) combines data catalog and data literacy capabilities for greater awareness of and access to available data assets, guidance on their use, and guardrails to ensure data policies and best practices are followed. Automatically harvest, transform and feed metadata from a wide array of data sources, operational processes, business applications and data models into a central catalog. Then make it accessible and understandable via role-based, contextual views so stakeholders can make strategic decisions based on accurate insights. erwin DI supports enterprise data governance, digital transformation and any effort that relies on data for favorable outcomes. Schedule ongoing scans of metadata from the widest array of data sources. Easily map data elements from source to target, including data in motion, and harmonize data integration across platforms. Enable data consumers to define and discover data relevant to their roles.
    Starting Price: $299 per month
  • 15
    Blindata

    Blindata

    Blindata

    Blindata covers all the functions of a Data Governance program: Business Glossary, Data Catalog & Data Lineage build an integrated and complete view on your Data. Data Classification module gives a semantic meaning to the data while the Data Quality, Issue Management & Data Stewardship modules improve the reliability and trust on data. Moreover, privacy compliance can leverage specific features: registry of processing activities, centralized privacy note management, consent registry with Blockchain integrated notarization. Blindata Agent can connect to different data sources, collecting metadata such data structures (Tables, Views, Fields, …), data quality metrics, reverse lineage, etc. Blindata has a modular and entirely API based architecture allowing systematic integration with the most critical business systems (DBMS, Active Directory, e-commerce, Data Platforms). Blindata is available as SaaS, can be installed “on Premise” or purchased on AWS Marketplace.
    Starting Price: $2000/year/user
  • 16
    DataGalaxy

    DataGalaxy

    DataGalaxy

    DataGalaxy’s all-in-one data catalog offers out-of-the-box actionability with fully-customizable attributes, visualization tools, and AI integration to give business teams the ability to document, link, and track all their metadata assets. The Data Catalog 360°’s user-centric platform is dedicated to metadata mapping, management, and knowledge sharing to help your organization manage data your way. A data catalog enables employees from all teams to collaborate using centralized, homogeneous data sets. Our data catalog provides clarity on data definitions, synonyms, and essential business attributes with a semantic layer so all users can understand and leverage their data as an asset. When you need answers about specific metadata, turn to the data catalog that identifies a topic’s 360° data experts, owners, and stewards empowering your team through streamlined collaboration.
  • 17
    Acryl Data

    Acryl Data

    Acryl Data

    No more data catalog ghost towns. Acryl Cloud drives fast time-to-value via Shift Left practices for data producers and an intuitive UI for data consumers. Continuously detect data quality incidents in real-time, automate anomaly detection to prevent breakages, and drive fast resolution when they do occur. Acryl Cloud supports both push-based and pull-based metadata ingestion for easy maintenance, ensuring information is trustworthy, up-to-date, and definitive. Data should be operational. Go beyond simple visibility and use automated Metadata Tests to continuously expose data insights and surface new areas for improvement. Reduce confusion and accelerate resolution with clear asset ownership, automatic detection, streamlined alerts, and time-based lineage for tracing root causes.
  • 18
    Zaloni Arena
    End-to-end DataOps built on an agile platform that improves and safeguards your data assets. Arena is the premier augmented data management platform. Our active data catalog enables self-service data enrichment and consumption to quickly control complex data environments. Customizable workflows that increase the accuracy and reliability of every data set. Use machine-learning to identify and align master data assets for better data decisioning. Complete lineage with detailed visualizations alongside masking and tokenization for superior security. We make data management easy. Arena catalogs your data, wherever it is and our extensible connections enable analytics to happen across your preferred tools. Conquer data sprawl challenges: Our software drives business and analytics success while providing the controls and extensibility needed across today’s decentralized, multi-cloud data complexity.
  • 19
    Secuvy AI

    Secuvy AI

    Secuvy AI

    Secuvy is a next-generation cloud platform to automate data security, privacy compliance and governance via AI-driven workflows. Best in class data intelligence especially for unstructured data. Secuvy is a next-generation cloud platform to automate data security, privacy compliance and governance via ai-driven workflows. Best in class data intelligence especially for unstructured data. Automated data discovery, customizable subject access requests, user validations, data maps & workflows for privacy regulations such as ccpa, gdpr, lgpd, pipeda and other global privacy laws. Data intelligence to find sensitive and privacy information across multiple data stores at rest and in motion. In a world where data is growing exponentially, our mission is to help organizations to protect their brand, automate processes, and improve trust with customers. With ever-expanding data sprawls we wish to reduce human efforts, costs & errors for handling Sensitive Data.
  • 20
    Apache Atlas

    Apache Atlas

    Apache Software Foundation

    Atlas is a scalable and extensible set of core foundational governance services – enabling enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the whole enterprise data ecosystem. Apache Atlas provides open metadata management and governance capabilities for organizations to build a catalog of their data assets, classify and govern these assets and provide collaboration capabilities around these data assets for data scientists, analysts and the data governance team. Pre-defined types for various Hadoop and non-Hadoop metadata. Ability to define new types for the metadata to be managed. Types can have primitive attributes, complex attributes, object references; can inherit from other types. Instances of types, called entities, capture metadata object details and their relationships. REST APIs to work with types and instances allow easier integration.
  • 21
    IBM Manta Data Lineage
    IBM Manta Data Lineage is a data lineage platform that increases data pipeline transparency so businesses can determine data accuracy throughout their models and systems. As businesses integrate AI into their workflows and data becomes more complex, data quality, provenance, and lineage are increasingly important. In fact, IBM’s 2023 CEO study found the number one barrier to generative AI adoption is concerns about the lineage of data.  IBM offers an automated data lineage platform that automatically scans your applications to build a powerful map of all data flows. The platform then delivers the info through a native user interface (UI) and other channels to both technical and nontechnical users. With IBM Manta Data Lineage, data operations teams get comprehensive visibility and control of their data pipeline. By improving your understanding and use of dynamic metadata, you can ensure that data is managed efficiently and accurately across complex systems.
  • 22
    Alation

    Alation

    Alation

    Alation is the first company to bring a data catalog to market. It radically improves how people find, understand, trust, use, and reuse data. Alation pioneered active, non-invasive data governance, which supports both data democratization and compliance at scale, so people have the data they need alongside guidance on how to use it correctly. By combining human insight with AI and machine learning, Alation tackles the toughest challenges in data today. More than 350 enterprises use Alation to make confident, data-driven decisions. American Family Insurance, Exelon, Munich Re, and Pfizer are all proud customers.
  • 23
    Accurity

    Accurity

    Accurity

    With Accurity, the all-in-one data intelligence platform, you get a company-wide understanding and complete trust in your data — speed up business-critical decision making, increase your revenue, reduce your costs, and ensure your company’s data compliance. Equipped with timely, relevant, and accurate data, you can successfully satisfy and engage with your customers, elevating your brand awareness and driving sales conversions. With everything accessible from a single interface, automated quality checks, and data quality issue workflows, you can lower personnel and infrastructure costs, and spend time utilizing your data rather than just managing it. Discover real value in your data by revealing and removing inefficiencies, improving your decision-making processes, and finding valuable product and customer information to boost your company’s innovation.
  • 24
    Masthead

    Masthead

    Masthead

    See the impact of data issues without running SQL. We analyze your logs and metadata to identify freshness and volume anomalies, schema changes in tables, pipeline errors, and their blast radius effects on your business. Masthead observes every table, process, script, and dashboard in the data warehouse and connected BI tools for anomalies, alerting data teams in real time if any data failures occur. Masthead shows the origin and implications of data anomalies and pipeline errors on data consumers. Masthead maps data issues on lineage, so you can troubleshoot within minutes, not hours. We get a comprehensive view of all processes in GCP without giving access to our data was a game-changer for us. It saved us both time and money. Gain visibility into the cost of each pipeline running in your cloud, regardless of ETL. Masthead also has AI-powered recommendations to help you optimize your models and queries. It takes 15 min to connect Masthead to all assets in your data warehouse.
    Starting Price: $899 per month
  • 25
    TIBCO Cloud Metadata
    One challenge in metadata management is the lack of connection between silos of metadata used in IT, operations, analytics, and compliance. TIBCO Cloud™ Metadata software is a single solution that spans all your metadata: data dictionaries, business glossaries, and data catalogs. Built-in artificial intelligence (AI) and machine learning (ML) algorithms facilitate metadata classification and data lineages (horizontal, vertical, regulatory). Deliver the data context, coherency, and control you need to achieve the highest efficiency, best performance, and smartest decision-making across all your teams and departments. Effective execution, analysis, and governance require accurate and consistent metadata about your operations, analytics, and compliance efforts. Instead of multiple silos, use a single solution. Discover, harvest, and manage metadata from all the applications, databases, data lakes, enterprise data warehouses, APIs, social media, and streaming sources you use.
  • 26
    erwin Data Catalog
    erwin Data Catalog by Quest is metadata management software that helps organizations learn what data they have and where it’s located, including data at rest and in motion. It tells you the data and metadata available for a certain topic so those particular sources and assets can be found quickly for analysis and decision-making. erwin Data Catalog automates the processes involved in harvesting, integrating, activating and governing enterprise data according to business requirements. This automation results in greater accuracy and faster time to value for data governance and digital transformation efforts, including data warehouse, data lake, data vault and other Big Data deployments, cloud migrations, etc. Metadata management is key to sustainable data governance and any other organizational effort for which data is key to the outcome. erwin Data Catalog automates enterprise metadata management, data mapping, data cataloging, code generation, data profiling and data lineage.
  • 27
    Kylo

    Kylo

    Teradata

    Kylo is an open source enterprise-ready data lake management software platform for self-service data ingest and data preparation with integrated metadata management, governance, security and best practices inspired by Think Big's 150+ big data implementation projects. Self-service data ingest with data cleansing, validation, and automatic profiling. Wrangle data with visual sql and an interactive transform through a simple user interface. Search and explore data and metadata, view lineage, and profile statistics. Monitor health of feeds and services in the data lake. Track SLAs and troubleshoot performance. Design batch or streaming pipeline templates in Apache NiFi and register with Kylo to enable user self-service. Organizations can expend significant engineering effort moving data into Hadoop yet struggle to maintain governance and data quality. Kylo dramatically simplifies data ingest by shifting ingest to data owners through a simple guided UI.
  • 28
    Informatica Enterprise Data Catalog
    Scan and index metadata, discover and profile data, and provide detailed lineage across tens of millions of data sets. Classify and organize data assets across any environment to maximize data value and reuse. Automatically scan across multi-cloud platforms, BI tools, ETL, and third-party metadata catalogs; and data types. Leverage AI-powered domain discovery, data similarity, business term associations, and recommendations. Track data movement, from high-level system views to granular column-level lineage, and get detailed impact analysis. Use the Data Asset Analytics dashboard to understand asset usage, enrichment, and collaboration. View data quality rules, scorecards, metric groups, and profiling stats in context. Tap into shared data knowledge with certifications, ratings and reviews, a Q&A platform, and change notifications. Our broad and deep lineup of enterprise-grade data management solutions sets Informatica apart from the crowd.
  • 29
    Castor

    Castor

    Castor

    Castor is a data catalog designed for mass adoption across the whole company. Have an overview of all your data environment. Search for data instantly thanks to our powerful search engine. Onboard to a new data infrastructure and access data in a breeze. Go beyond your traditional data catalog. Modern data teams now have numerous data sources, build one truth. With its delightful and automated documentation experience, Castor makes it dead simple to trust data. Column-level, cross-system data lineage in minutes. Get a bird’s eye view of your data pipelines to build trust in your data. Troubleshoot data issues, perform impact analyses, comply with GDPR in one tool. Optimize performance, cost, compliance, and security for your data. Keep your data stack healthy with our automated infrastructure monitoring system.
    Starting Price: $699 per month
  • 30
    SAP Information Steward
    SAP Information Steward software supports data profiling and monitoring and information policy management. As the information governance layer of SAP Business Technology Platform, it can help you anticipate risk and drive better business outcomes. Combine data profiling, data lineage, and metadata management to gain continuous insight into the integrity of your enterprise data model. Gain a better understanding of data quality across your data management landscape,while accessing and analyzing metrics with intuitive dashboards and scorecards. Improve enterprise information management initiatives by supporting analysts, data stewards, and IT experts with consistent validation rules and guidelines. Discover, assess, define, monitor, and improve the quality of your enterprise data assets with data profiling and metadata management – all with one solution. Forecast the potential savings of improved data quality by running what-if analyses.
  • 31
    Secoda

    Secoda

    Secoda

    With Secoda AI on top of your metadata, you can now get contextual search results from across your tables, columns, dashboards, metrics, and queries. Secoda AI can also help you generate documentation and queries from your metadata, saving your team hundreds of hours of mundane work and redundant data requests. Easily search across all columns, tables, dashboards, events, and metrics. AI-powered search lets you ask any question to your data and get a contextual answer, fast. Get answers to questions. Integrate data discovery into your workflow without disrupting it with our API. Perform bulk updates, tag PII data, manage tech debt, build custom integrations, identify the least used resources, and more. Eliminate manual error and have total trust in your knowledge repository.
    Starting Price: $50 per user per month
  • 32
    Databricks Data Intelligence Platform
    The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data. The winners in every industry will be data and AI companies. From ETL to data warehousing to generative AI, Databricks helps you simplify and accelerate your data and AI goals. Databricks combines generative AI with the unification benefits of a lakehouse to power a Data Intelligence Engine that understands the unique semantics of your data. This allows the Databricks Platform to automatically optimize performance and manage infrastructure in ways unique to your business. The Data Intelligence Engine understands your organization’s language, so search and discovery of new data is as easy as asking a question like you would to a coworker.
  • 33
    Qlik Catalog
    When you empower your business with on-demand access to analytics-ready data, you accelerate discovery and people get answers faster. Qlik Catalog is an enterprise data catalog that simplifies and accelerates the profiling, organization, preparation, and delivery of trustworthy, actionable data in days, not months. Qlik Catalog builds a secure, enterprise-scale catalog of all the data your organization has available for analytics, no matter where it is. Powerful, automated data preparation and metadata tools streamline the transformation of raw data into analytics-ready information assets. Business users get a single, go-to data catalog to find, understand, and use any enterprise data source to gain insights. Automatically profile and document the exact content, structure, and quality of your data using built-in data loaders to simplify and accelerate the process. Build a Smart Data Catalog that documents every aspect of your data.
    Starting Price: $30 per user per month
  • 34
    Tree Schema Data Catalog
    The essential tool for metadata management. Automatically populate your entire catalog in under 5 minutes! Data Discovery. Find the data you need anywhere within your data ecosystem from the database all the way down to the specific values for each field. Automatically document your data from existing data stores. First-class support for tabular and unstructured data. Automated data governance actions. Data Lineage. Explore your data lineage and understand where your data comes from and where it is going. View impact analysis of changes Find all up and downstream impacts. Visualize relationships and connections. API AccessNew. Manage your data lineage as code and keep your catalog up to date with the Tree Schema API. Integrate Data Lineage into CICD pipelines Capture values & descriptions within your code Analyze impact for breaking changes. Data Dictionary. Know the key terms and lingo that drive your business. Define the context and scope for keywords
    Starting Price: $99 per month
  • 35
    Foundational

    Foundational

    Foundational

    Identify code and optimization issues in real-time, prevent data incidents pre-deploy, and govern data-impacting code changes end to end—from the operational database to the user-facing dashboard. Automated, column-level data lineage, from the operational database all the way to the reporting layer, ensures every dependency is analyzed. Foundational automates data contract enforcement by analyzing every repository from upstream to downstream, directly from source code. Use Foundational to proactively identify code and data issues, find and prevent issues, and create controls and guardrails. Foundational can be set up in minutes with no code changes required.
  • 36
    Datafold

    Datafold

    Datafold

    Prevent data outages by identifying and fixing data quality issues before they get into production. Go from 0 to 100% test coverage of your data pipelines in a day. Know the impact of each code change with automatic regression testing across billions of rows. Automate change management, improve data literacy, achieve compliance, and reduce incident response time. Don’t let data incidents take you by surprise. Be the first one to know with automated anomaly detection. Datafold’s easily adjustable ML model adapts to seasonality and trend patterns in your data to construct dynamic thresholds. Save hours spent on trying to understand data. Use the Data Catalog to find relevant datasets, fields, and explore distributions easily with an intuitive UI. Get interactive full-text search, data profiling, and consolidation of metadata in one place.
  • 37
    Ataccama ONE

    Ataccama ONE

    Ataccama

    Ataccama reinvents the way data is managed to create value on an enterprise scale. Unifying Data Governance, Data Quality, and Master Data Management into a single, AI-powered fabric across hybrid and Cloud environments, Ataccama gives your business and data teams the ability to innovate with unprecedented speed while maintaining trust, security, and governance of your data.
  • 38
    IBM Databand
    Monitor your data health and pipeline performance. Gain unified visibility for pipelines running on cloud-native tools like Apache Airflow, Apache Spark, Snowflake, BigQuery, and Kubernetes. An observability platform purpose built for Data Engineers. Data engineering is only getting more challenging as demands from business stakeholders grow. Databand can help you catch up. More pipelines, more complexity. Data engineers are working with more complex infrastructure than ever and pushing higher speeds of release. It’s harder to understand why a process has failed, why it’s running late, and how changes affect the quality of data outputs. Data consumers are frustrated with inconsistent results, model performance, and delays in data delivery. Not knowing exactly what data is being delivered, or precisely where failures are coming from, leads to persistent lack of trust. Pipeline logs, errors, and data quality metrics are captured and stored in independent, isolated systems.
  • 39
    ThinkData Works

    ThinkData Works

    ThinkData Works

    Data is the backbone of effective decision-making. However, employees spend more time managing it than using it. ThinkData Works provides a robust catalog platform for discovering, managing, and sharing data from both internal and external sources. Enrichment solutions combine partner data with your existing datasets to produce uniquely valuable assets that can be shared across your entire organization. Unlock the value of your data investment by making data teams more efficient, improving project outcomes, replacing multiple existing tech solutions, and providing you with a competitive advantage.
  • 40
    Microsoft Purview
    Microsoft Purview is a unified data governance service that helps you manage and govern your on-premises, multicloud, and software-as-a-service (SaaS) data. Easily create a holistic, up-to-date map of your data landscape with automated data discovery, sensitive data classification, and end-to-end data lineage. Empower data consumers to find valuable, trustworthy data. Automated data discovery, lineage identification, and data classification across on-premises, multicloud, and SaaS sources. Unified map of your data assets and their relationships for more effective governance. Semantic search enables data discovery using business or technical terms. Insight into the location and movement of sensitive data across your hybrid data landscape. Establish the foundation for effective data usage and governance with Purview Data Map. Automate and manage metadata from hybrid sources. Classify data using built-in and custom classifiers and Microsoft Information Protection sensitivity labels.
    Starting Price: $0.342
  • 41
    rudol

    rudol

    rudol

    Unify your data catalog, reduce communication overhead and enable quality control to any member of your company, all without deploying or installing anything. rudol is a data quality platform that helps companies understand all their data sources, no matter where they come from; reduces excessive communication in reporting processes or urgencies; and enables data quality diagnosing and issue prevention to all the company, through easy steps With rudol, each organization is able to add data sources from a growing list of providers and BI tools with a standardized structure, including MySQL, PostgreSQL, Airflow, Redshift, Snowflake, Kafka, S3*, BigQuery*, MongoDB*, Tableau*, PowerBI*, Looker* (* in development). So, regardless of where it’s coming from, people can understand where and how the data is stored, read and collaborate with its documentation, or easily contact data owners using our integrations.
    Starting Price: $0
  • 42
    Datakin

    Datakin

    Datakin

    Instantly reveal the order hidden within your complex data world, and always know exactly where to look for answers. Datakin automatically traces data lineage, showing your entire data ecosystem in a rich visual graph. It clearly illustrates the upstream and downstream relationships for each dataset. The Duration tab summarizes a job’s performance in a Gantt-style chart along with its upstream dependencies, making it easy to find bottlenecks. When you need to pinpoint the exact moment of a breaking change, the Compare tab shows how your jobs and datasets have changed between runs. Sometimes jobs that run successfully produce bad output. The Quality tab surfaces critical data quality metrics, showing how they change over time so anomalies become obvious. Datakin helps you find the root cause of issues quickly – and prevent new ones from occurring.
    Starting Price: $2 per month
  • 43
    Zeenea Data Catalog
    With our cloud-based Data Catalog, provide a next-generation workspace for your data teams to find, understand and collaborate on your data assets and usages. Empower your data teams by creating an understandable & reliable data asset landscape at the enterprise level. Simple to set up. Easy to use. Ideal for scaling. Easily organize and share your data knowledge and usages across your team and beyond. Get all the insights from your data in real time by connecting Zeenea to your sources. Increase the productivity of your data teams by simplifying and reducing the time spent searching for enterprise assets. Create a library for all your data assets and usages available at the enterprise-level by using the full potential of their metadata in just a few clicks. And ultimately improve any data discovery path. So your data teams can focus on what’s important: delivering their data projects on time.
  • 44
    Tokern

    Tokern

    Tokern

    Open source data governance suite for databases and data lakes. Tokern is a simple to use toolkit to collect, organize and analyze data lake's metadata. Run as a command-line app for quick tasks. Run as a service for continuous collection of metadata. Analyze lineage, access control and PII datasets using reporting dashboards or programmatically in Jupyter notebooks. Tokern is an open source data governance suite for databases and data lakes. Improve ROI of your data, comply with regulations like HIPAA, CCPA and GDPR and protect critical data from insider threats with confidence. Centralized metadata management of users, datasets and jobs. Powers other data governance features. Track Column Level Data Lineage for Snowflake, AWS Redshift and BigQuery. Build lineage from query history or ETL scripts. Explore lineage using interactive graphs or programmatically using APIs or SDKs.
  • 45
    SAP Data Intelligence
    Turn data chaos into data value with data intelligence. Connect, discover, enrich, and orchestrate disjointed data assets into actionable business insights at enterprise scale. SAP Data Intelligence is a comprehensive data management solution. As the data orchestration layer of SAP’s Business Technology Platform, it transforms distributed data sprawls into vital data insights, delivering innovation at scale. Provide your users with intelligent, relevant, and contextual insights with integration across the IT landscape. Integrate and orchestrate massive data volumes and streams at scale. Streamline, operationalize, and govern innovation driven by machine learning. Optimize governance and minimize compliance risk with comprehensive metadata management rules. Connect, discover, enrich, and orchestrate disjointed data assets into actionable business insights at enterprise scale.
    Starting Price: $1.22 per month
  • 46
    Octopai

    Octopai

    Octopai

    Harness the power of data lineage, discovery and a data catalog to achieve full control of your data. that can instantly navigate through the most complex data landscapes. Gain access to the most comprehensive automated data lineage, discovery and data catalog. Providing unprecedented visibility and trust into the most complex data environments. Octopai extracts metadata from your entire data environment. With a quick, secure and simple process, Octopai will instantly be able to analyze the metadata. In one centralized platform Octopai allows you to access data lineage, data discovery and a data catalog, automatically. Trace any data end-to-end through your entire data landscape, in seconds. Automatically find the data you need anywhere in your data landscape. Create company-wide consistency with a self-creating, self-updating data catalog.
  • 47
    Y42

    Y42

    Datos-Intelligence GmbH

    Y42 is the first fully managed Modern DataOps Cloud. It is purpose-built to help companies easily design production-ready data pipelines on top of their Google BigQuery or Snowflake cloud data warehouse. Y42 provides native integration of best-of-breed open-source data tools, comprehensive data governance, and better collaboration for data teams. With Y42, organizations enjoy increased accessibility to data and can make data-driven decisions quickly and efficiently.
  • 48
    Datalogz

    Datalogz

    Datalogz

    Data knowledge management platform that enables teams to streamline data discovery and understanding with the ultimate goal of being able to trust their data. Prevent misreporting analytics and costly mistakes today!
  • 49
    Datameer

    Datameer

    Datameer

    Datameer revolutionizes data transformation with a low-code approach, trusted by top global enterprises. Craft, transform, and publish data seamlessly with no code and SQL, simplifying complex data engineering tasks. Empower your data teams to make informed decisions confidently while saving costs and ensuring responsible self-service analytics. Speed up your analytics workflow by transforming datasets to answer ad-hoc questions and support operational dashboards. Empower everyone on your team with our SQL or Drag-and-Drop to transform your data in an intuitive and collaborative workspace. And best of all, everything happens in Snowflake. Datameer is designed and optimized for Snowflake to reduce data movement and increase platform adoption. Some of the problems Datameer solves: - Analytics is not accessible - Drowning in backlog - Long development
  • 50
    IBM InfoSphere Information Analyzer
    Understanding the quality, content and structure of your data is an important first step when making critical business decisions. IBM® InfoSphere® Information Analyzer, a component of IBM InfoSphere Information Server, evaluates data quality and structure within and across heterogeneous systems. It utilizes a reusable rules library and supports multi-level evaluations by rule record and pattern. It also facilitates the management of exceptions to established rules to help identify data inconsistencies, redundancies, and anomalies, and make inferences about the best choices for structure.