Apache Hive

Apache Hive

Apache Software Foundation
Apache Spark

Apache Spark

Apache Software Foundation
+
+

Related Products

  • Google Cloud BigQuery
    1,861 Ratings
    Visit Website
  • StarTree
    26 Ratings
    Visit Website
  • AnalyticsCreator
    46 Ratings
    Visit Website
  • ActiveBatch Workload Automation
    353 Ratings
    Visit Website
  • Semarchy xDM
    63 Ratings
    Visit Website
  • Google Cloud Run
    270 Ratings
    Visit Website
  • Twilio
    1,301 Ratings
    Visit Website
  • Quick Consols
    49 Ratings
    Visit Website
  • PeerGFS
    22 Ratings
    Visit Website
  • JOpt.TourOptimizer
    8 Ratings
    Visit Website

About

The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive. Apache Hive is an open source project run by volunteers at the Apache Software Foundation. Previously it was a subproject of Apache® Hadoop®, but has now graduated to become a top-level project of its own. We encourage you to learn about the project and contribute your expertise. Traditional SQL queries must be implemented in the MapReduce Java API to execute SQL applications and queries over distributed data. Hive provides the necessary SQL abstraction to integrate SQL-like queries (HiveQL) into the underlying Java without the need to implement queries in the low-level Java API.

About

Apache Spark™ is a unified analytics engine for large-scale data processing. Apache Spark achieves high performance for both batch and streaming data, using a state-of-the-art DAG scheduler, a query optimizer, and a physical execution engine. Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python, R, and SQL shells. Spark powers a stack of libraries including SQL and DataFrames, MLlib for machine learning, GraphX, and Spark Streaming. You can combine these libraries seamlessly in the same application. Spark runs on Hadoop, Apache Mesos, Kubernetes, standalone, or in the cloud. It can access diverse data sources. You can run Spark using its standalone cluster mode, on EC2, on Hadoop YARN, on Mesos, or on Kubernetes. Access data in HDFS, Alluxio, Apache Cassandra, Apache HBase, Apache Hive, and hundreds of other data sources.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers and anyone looking for a data warehouse software that facilitates reading, writing, and managing large datasets using SQL

Audience

Organizations that want a unified analytics engine for large-scale data processing

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 5.0 / 5
ease 5.0 / 5
features 4.0 / 5
design 5.0 / 5
support 5.0 / 5

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apache Software Foundation
Founded: 1999
United States
hive.apache.org

Company Information

Apache Software Foundation
Founded: 1999
United States
spark.apache.org

Alternatives

Apache Drill

Apache Drill

The Apache Software Foundation

Alternatives

dbt

dbt

dbt Labs
Apache HBase

Apache HBase

The Apache Software Foundation
AWS Glue

AWS Glue

Amazon
Apache Hudi

Apache Hudi

Apache Corporation
Apache Spark

Apache Spark

Apache Software Foundation

Categories

Categories

Streaming Analytics Features

Data Enrichment
Data Wrangling / Data Prep
Multiple Data Source Support
Process Automation
Real-time Analysis / Reporting
Visualization Dashboards

Integrations

Alteryx
Amazon EMR
Amundsen
Apache Hudi
Apache Iceberg
Apache Kylin
Apache Zeppelin
DataHub
Dataiku
Flyte
IBM watsonx.data
Inferyx
MLlib
Mage Dynamic Data Masking
Okera
PHEMI Health DataLab
StarRocks
Timbr.ai
Union Cloud
lakeFS

Integrations

Alteryx
Amazon EMR
Amundsen
Apache Hudi
Apache Iceberg
Apache Kylin
Apache Zeppelin
DataHub
Dataiku
Flyte
IBM watsonx.data
Inferyx
MLlib
Mage Dynamic Data Masking
Okera
PHEMI Health DataLab
StarRocks
Timbr.ai
Union Cloud
lakeFS
Claim Apache Hive and update features and information
Claim Apache Hive and update features and information
Claim Apache Spark and update features and information
Claim Apache Spark and update features and information