Best Data Engineering Tools for OpenMetadata

Compare the Top Data Engineering Tools that integrate with OpenMetadata as of July 2025

This a list of Data Engineering tools that integrate with OpenMetadata. Use the filters on the left to add additional filters for products that have integrations with OpenMetadata. View the products that work with OpenMetadata in the table below.

What are Data Engineering Tools for OpenMetadata?

Data engineering tools are designed to facilitate the process of preparing and managing large datasets for analysis. These tools support tasks like data extraction, transformation, and loading (ETL), allowing engineers to build efficient data pipelines that move and process data from various sources into storage systems. They help ensure data integrity and quality by providing features for validation, cleansing, and monitoring. Data engineering tools also often include capabilities for automation, scalability, and integration with big data platforms. By streamlining complex workflows, they enable organizations to handle large-scale data operations more efficiently and support advanced analytics and machine learning initiatives. Compare and read user reviews of the best Data Engineering tools for OpenMetadata currently available using the table below. This list is updated regularly.

  • 1
    Google Cloud BigQuery
    BigQuery is an essential tool for data engineers, allowing them to streamline the process of data ingestion, transformation, and analysis. With its scalable infrastructure and robust suite of data engineering features, users can efficiently build data pipelines and automate workflows. BigQuery integrates easily with other Google Cloud tools, making it a versatile solution for data engineering tasks. New customers can take advantage of $300 in free credits to explore BigQuery’s features, enabling them to build and refine their data workflows for maximum efficiency and effectiveness. This allows engineers to focus more on innovation and less on managing the underlying infrastructure.
    Starting Price: Free ($300 in free credits)
    View Tool
    Visit Website
  • 2
    Fivetran

    Fivetran

    Fivetran

    Fivetran is a leading data integration platform that centralizes an organization’s data from various sources to enable modern data infrastructure and drive innovation. It offers over 700 fully managed connectors to move data automatically, reliably, and securely from SaaS applications, databases, ERPs, and files to data warehouses and lakes. The platform supports real-time data syncs and scalable pipelines that fit evolving business needs. Trusted by global enterprises like Dropbox, JetBlue, and Pfizer, Fivetran helps accelerate analytics, AI workflows, and cloud migrations. It features robust security certifications including SOC 1 & 2, GDPR, HIPAA, and ISO 27001. Fivetran provides an easy-to-use, customizable platform that reduces engineering time and enables faster insights.
    View Tool
    Visit Website
  • 3
    Domo

    Domo

    Domo

    Domo puts data to work for everyone so they can multiply their impact on the business. Our cloud-native data experience platform goes beyond traditional business intelligence and analytics, making data visible and actionable with user-friendly dashboards and apps. Underpinned by a secure data foundation that connects with existing cloud and legacy systems, Domo helps companies optimize critical business processes at scale and in record time to spark the bold curiosity that powers exponential business results.
  • 4
    Looker

    Looker

    Google

    Looker, Google Cloud’s business intelligence platform, enables you to chat with your data. Organizations turn to Looker for self-service and governed BI, to build custom applications with trusted metrics, or to bring Looker modeling to their existing environment. The result is improved data engineering efficiency and true business transformation. Looker is reinventing business intelligence for the modern company. Looker works the way the web does: browser-based, its unique modeling language lets any employee leverage the work of your best data analysts. Operating 100% in-database, Looker capitalizes on the newest, fastest analytic databases—to get real results, in real time.
  • 5
    Teradata VantageCloud
    Teradata VantageCloud is a comprehensive cloud-based analytics and data platform that allows businesses to unlock the full potential of their data with unparalleled speed, scalability, and operational flexibility. Engineered for enterprise-grade performance, VantageCloud supports seamless AI and machine learning integration, enabling organizations to generate real-time insights and make informed decisions faster. It offers deployment flexibility across public clouds, hybrid environments, or on-premise setups, making it highly adaptable to existing infrastructures. With features like unified data architecture, intelligent governance, and optimized cost-efficiency, VantageCloud helps businesses reduce complexity, drive innovation, and maintain a competitive edge in today’s data-driven world.
  • 6
    dbt

    dbt

    dbt Labs

    Version control, quality assurance, documentation and modularity allow data teams to collaborate like software engineering teams. Analytics errors should be treated with the same level of urgency as bugs in a production product. Much of an analytic workflow is manual. We believe workflows should be built to execute with a single command. Data teams use dbt to codify business logic and make it accessible to the entire organization—for use in reporting, ML modeling, and operational workflows. Built-in CI/CD ensures that changes to data models move appropriately through development, staging, and production environments. dbt Cloud also provides guaranteed uptime and custom SLAs.
    Starting Price: $50 per user per month
  • 7
    Databricks Data Intelligence Platform
    The Databricks Data Intelligence Platform allows your entire organization to use data and AI. It’s built on a lakehouse to provide an open, unified foundation for all data and governance, and is powered by a Data Intelligence Engine that understands the uniqueness of your data. The winners in every industry will be data and AI companies. From ETL to data warehousing to generative AI, Databricks helps you simplify and accelerate your data and AI goals. Databricks combines generative AI with the unification benefits of a lakehouse to power a Data Intelligence Engine that understands the unique semantics of your data. This allows the Databricks Platform to automatically optimize performance and manage infrastructure in ways unique to your business. The Data Intelligence Engine understands your organization’s language, so search and discovery of new data is as easy as asking a question like you would to a coworker.
  • 8
    Delta Lake

    Delta Lake

    Delta Lake

    Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads. Data lakes typically have multiple data pipelines reading and writing data concurrently, and data engineers have to go through a tedious process to ensure data integrity, due to the lack of transactions. Delta Lake brings ACID transactions to your data lakes. It provides serializability, the strongest level of isolation level. Learn more at Diving into Delta Lake: Unpacking the Transaction Log. In big data, even the metadata itself can be "big data". Delta Lake treats metadata just like data, leveraging Spark's distributed processing power to handle all its metadata. As a result, Delta Lake can handle petabyte-scale tables with billions of partitions and files at ease. Delta Lake provides snapshots of data enabling developers to access and revert to earlier versions of data for audits, rollbacks or to reproduce experiments.
  • Previous
  • You're on page 1
  • Next