Best Real-Time Data Streaming Tools for Apache Hive

Compare the Top Real-Time Data Streaming Tools that integrate with Apache Hive as of December 2025

This a list of Real-Time Data Streaming tools that integrate with Apache Hive. Use the filters on the left to add additional filters for products that have integrations with Apache Hive. View the products that work with Apache Hive in the table below.

What are Real-Time Data Streaming Tools for Apache Hive?

Real-time data streaming tools enable organizations, big data and machine learning professionals, and data scientists to stream data in real time, and build data models when new data is created or ingested. Compare and read user reviews of the best Real-Time Data Streaming tools for Apache Hive currently available using the table below. This list is updated regularly.

  • 1
    Apache Doris

    Apache Doris

    The Apache Software Foundation

    Apache Doris is a modern data warehouse for real-time analytics. It delivers lightning-fast analytics on real-time data at scale. Push-based micro-batch and pull-based streaming data ingestion within a second. Storage engine with real-time upsert, append and pre-aggregation. Optimize for high-concurrency and high-throughput queries with columnar storage engine, MPP architecture, cost based query optimizer, vectorized execution engine. Federated querying of data lakes such as Hive, Iceberg and Hudi, and databases such as MySQL and PostgreSQL. Compound data types such as Array, Map and JSON. Variant data type to support auto data type inference of JSON data. NGram bloomfilter and inverted index for text searches. Distributed design for linear scalability. Workload isolation and tiered storage for efficient resource management. Supports shared-nothing clusters as well as separation of storage and compute.
    Starting Price: Free
  • 2
    TapData

    TapData

    TapData

    CDC-based live data platform for heterogeneous database replication, real-time data integration, or building a real-time data warehouse. By using CDC to sync production line data stored in DB2 and Oracle to the modern database, TapData enabled an AI-augmented real-time dispatch software to optimize the semiconductor production line process. The real-time data made instant decision-making in the RTD software a possibility, leading to faster turnaround times and improved yield. As one of the largest telcos, customer has many regional systems that cater to the local customers. By syncing and aggregating data from various sources and locations into a centralized data store, customers were able to build an order center where the collective orders from many applications can now be aggregated. TapData seamlessly integrates inventory data from 500+ stores, providing real-time insights into stock levels and customer preferences, enhancing supply chain efficiency.
  • Previous
  • You're on page 1
  • Next