Best Columnar Databases for Apache Airflow

Compare the Top Columnar Databases that integrate with Apache Airflow as of June 2025

This a list of Columnar Databases that integrate with Apache Airflow. Use the filters on the left to add additional filters for products that have integrations with Apache Airflow. View the products that work with Apache Airflow in the table below.

What are Columnar Databases for Apache Airflow?

Columnar databases, also known as column-oriented databases or column-store databases, are a type of database that store data in columns instead of rows. Columnar databases have some advantages over traditional row databases including speed and efficiency. Compare and read user reviews of the best Columnar Databases for Apache Airflow currently available using the table below. This list is updated regularly.

  • 1
    Apache Druid
    Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture. Druid stores and compresses each column individually, and only needs to read the ones needed for a particular query, which supports fast scans, rankings, and groupBys. Druid creates inverted indexes for string values for fast search and filter. Out-of-the-box connectors for Apache Kafka, HDFS, AWS S3, stream processors, and more. Druid intelligently partitions data based on time and time-based queries are significantly faster than traditional databases. Scale up or down by just adding or removing servers, and Druid automatically rebalances. Fault-tolerant architecture routes around server failures.
  • 2
    CrateDB

    CrateDB

    CrateDB

    The enterprise database for time series, documents, and vectors. Store any type of data and combine the simplicity of SQL with the scalability of NoSQL. CrateDB is an open source distributed database running queries in milliseconds, whatever the complexity, volume and velocity of data.
  • Previous
  • You're on page 1
  • Next