Hadoop

Hadoop

Apache Software Foundation
+
+

Related Products

  • AnalyticsCreator
    46 Ratings
    Visit Website
  • Teradata VantageCloud
    1,105 Ratings
    Visit Website
  • dbt
    239 Ratings
    Visit Website
  • Kamatera
    152 Ratings
    Visit Website
  • Docket
    58 Ratings
    Visit Website
  • Comet Backup
    219 Ratings
    Visit Website
  • Yodeck
    7,501 Ratings
    Visit Website
  • RaimaDB
    12 Ratings
    Visit Website
  • DataBuck
    6 Ratings
    Visit Website
  • Google Cloud BigQuery
    2,008 Ratings
    Visit Website

About

Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads. Data lakes typically have multiple data pipelines reading and writing data concurrently, and data engineers have to go through a tedious process to ensure data integrity, due to the lack of transactions. Delta Lake brings ACID transactions to your data lakes. It provides serializability, the strongest level of isolation level. Learn more at Diving into Delta Lake: Unpacking the Transaction Log. In big data, even the metadata itself can be "big data". Delta Lake treats metadata just like data, leveraging Spark's distributed processing power to handle all its metadata. As a result, Delta Lake can handle petabyte-scale tables with billions of partitions and files at ease. Delta Lake provides snapshots of data enabling developers to access and revert to earlier versions of data for audits, rollbacks or to reproduce experiments.

About

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures. A wide variety of companies and organizations use Hadoop for both research and production. Users are encouraged to add themselves to the Hadoop PoweredBy wiki page. Apache Hadoop 3.3.4 incorporates a number of significant enhancements over the previous major release line (hadoop-3.2).

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Companies looking for a storage layer software solution for big data workloads

Audience

Data scientists and anyone looking for a platform to manage the distributed processing of large data sets

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Delta Lake
Founded: 2019
United States
delta.io

Company Information

Apache Software Foundation
Founded: 1999
United States
hadoop.apache.org

Alternatives

Apache Hudi

Apache Hudi

Apache Corporation

Alternatives

Apache Iceberg

Apache Iceberg

Apache Software Foundation
Apache Kudu

Apache Kudu

The Apache Software Foundation
Amazon EMR

Amazon EMR

Amazon
E-MapReduce

E-MapReduce

Alibaba
Apache Sentry

Apache Sentry

Apache Software Foundation

Categories

Categories

Integrations

Apache Spark
IBM StreamSets
Kyvos Semantic Layer
Okera
StarTree
Talend Data Fabric
lakeFS
Acxiom Real Identity
Apache Hudi
Apache Kudu
Apache Parquet
Apache Trafodion
Ascend
Flex83
Inferyx
Mage Platform
MySQL
PuppyGraph
Trino
Value Innovation Labs Marketing Automation Platform

Integrations

Apache Spark
IBM StreamSets
Kyvos Semantic Layer
Okera
StarTree
Talend Data Fabric
lakeFS
Acxiom Real Identity
Apache Hudi
Apache Kudu
Apache Parquet
Apache Trafodion
Ascend
Flex83
Inferyx
Mage Platform
MySQL
PuppyGraph
Trino
Value Innovation Labs Marketing Automation Platform
Claim Delta Lake and update features and information
Claim Delta Lake and update features and information
Claim Hadoop and update features and information
Claim Hadoop and update features and information