Audience

Data Warehouse solution that helps companies with streaming primitives over hadoop compatible storages

About Apache Hudi

Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing. Hudi maintains a timeline of all actions performed on the table at different instants of time that helps provide instantaneous views of the table, while also efficiently supporting retrieval of data in the order of arrival. A Hudi instant consists of the following components. Hudi provides efficient upserts, by mapping a given hoodie key consistently to a file id, via an indexing mechanism. This mapping between record key and file group/file id, never changes once the first version of a record has been written to a file. In short, the mapped file group contains all versions of a group of records.

Integrations

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

Apache Corporation
Founded: 1954
United States
hudi.apache.org

Videos and Screen Captures

Apache Hudi Screenshot 1
Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account

Product Details

Platforms Supported
Cloud
Training
Documentation
Support
Online

Apache Hudi Frequently Asked Questions

Q: What kinds of users and organization types does Apache Hudi work with?
Q: What languages does Apache Hudi support in their product?
Q: What kind of support options does Apache Hudi offer?
Q: What other applications or services does Apache Hudi integrate with?
Q: What type of training does Apache Hudi provide?

Apache Hudi Product Features

Data Warehouse

In-Memory Processing
Match & Merge
ETL - Extract / Transfer / Load
Data Migration
Ad hoc Query
Data Quality Control
Analytics
Data Integration