Apache Parquet

Apache Parquet

The Apache Software Foundation
+
+

Related Products

  • Google Cloud BigQuery
    1,851 Ratings
    Visit Website
  • TinyPNG
    45 Ratings
    Visit Website
  • Ango Hub
    15 Ratings
    Visit Website
  • Picsart Enterprise
    25 Ratings
    Visit Website
  • Comet Backup
    211 Ratings
    Visit Website
  • CirrusPrint
    2 Ratings
    Visit Website
  • Google Cloud Platform
    60,418 Ratings
    Visit Website
  • OmegaCube ERP
    13 Ratings
    Visit Website
  • AnalyticsCreator
    46 Ratings
    Visit Website
  • ProShop
    154 Ratings
    Visit Website

About

We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem. Parquet is built from the ground up with complex nested data structures in mind, and uses the record shredding and assembly algorithm described in the Dremel paper. We believe this approach is superior to simple flattening of nested namespaces. Parquet is built to support very efficient compression and encoding schemes. Multiple projects have demonstrated the performance impact of applying the right compression and encoding scheme to the data. Parquet allows compression schemes to be specified on a per-column level, and is future-proofed to allow adding more encodings as they are invented and implemented. Parquet is built to be used by anyone. The Hadoop ecosystem is rich with data processing frameworks, and we are not interested in playing favorites.

About

Processing and storing tabular datasets, e.g. from CSV or Parquet files. Large result set transfer to client. Large client/server installations for centralized enterprise data warehousing. Writing to a single database from multiple concurrent processes. DuckDB is a relational database management system (RDBMS). That means it is a system for managing data stored in relations. A relation is essentially a mathematical term for a table. Each table is a named collection of rows. Each row of a given table has the same set of named columns, and each column is of a specific data type. Tables themselves are stored inside schemas, and a collection of schemas constitutes the entire database that you can access.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Individuals requiring a columnar storage solution available to any project in the Hadoop ecosystem

Audience

Database management system for IT teams

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

The Apache Software Foundation
Founded: 1999
United States
parquet.apache.org

Company Information

DuckDB
duckdb.org

Alternatives

Apache Iceberg

Apache Iceberg

Apache Software Foundation

Alternatives

Apache DataFusion

Apache DataFusion

Apache Software Foundation
Apache Drill

Apache Drill

The Apache Software Foundation
Apache Iceberg

Apache Iceberg

Apache Software Foundation
Apache HBase

Apache HBase

The Apache Software Foundation
Apache Kudu

Apache Kudu

The Apache Software Foundation

Categories

Categories

Integrations

Flyte
PuppyGraph
QStudio
Streamkap
Tad
Amazon SageMaker Data Wrangler
AnalyticsCreator
Data Sentinel
Databricks Data Intelligence Platform
DbGate
DbVisualizer
GribStream
Kestra
MLJAR Studio
MotherDuck
Observable
PI.EXCHANGE
SkySQL
Timeplus
Warp 10

Integrations

Flyte
PuppyGraph
QStudio
Streamkap
Tad
Amazon SageMaker Data Wrangler
AnalyticsCreator
Data Sentinel
Databricks Data Intelligence Platform
DbGate
DbVisualizer
GribStream
Kestra
MLJAR Studio
MotherDuck
Observable
PI.EXCHANGE
SkySQL
Timeplus
Warp 10
Claim Apache Parquet and update features and information
Claim Apache Parquet and update features and information
Claim DuckDB and update features and information
Claim DuckDB and update features and information