Audience

Data Warehouse solution that helps companies with streaming primitives over hadoop compatible storages

About Apache Hudi

Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing. Hudi maintains a timeline of all actions performed on the table at different instants of time that helps provide instantaneous views of the table, while also efficiently supporting retrieval of data in the order of arrival. A Hudi instant consists of the following components. Hudi provides efficient upserts, by mapping a given hoodie key consistently to a file id, via an indexing mechanism. This mapping between record key and file group/file id, never changes once the first version of a record has been written to a file. In short, the mapped file group contains all versions of a group of records.

Integrations

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

Apache Corporation
Founded: 1954
United States
hudi.apache.org

Videos and Screen Captures

Apache Hudi Screenshot 1
Other Useful Business Software
$300 in Free Credit Towards Top Cloud Services Icon
$300 in Free Credit Towards Top Cloud Services

Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
Get Started

Product Details

Platforms Supported
Cloud
Training
Documentation
Support
Online

Apache Hudi Frequently Asked Questions

Q: What kinds of users and organization types does Apache Hudi work with?
Q: What languages does Apache Hudi support in their product?
Q: What kind of support options does Apache Hudi offer?
Q: What other applications or services does Apache Hudi integrate with?
Q: What type of training does Apache Hudi provide?

Apache Hudi Product Features

Data Warehouse

In-Memory Processing
Match & Merge
ETL - Extract / Transfer / Load
Data Migration
Ad hoc Query
Data Quality Control
Analytics
Data Integration