Apache Parquet

Apache Parquet

The Apache Software Foundation
+
+

Related Products

  • Google Cloud BigQuery
    1,871 Ratings
    Visit Website
  • Teradata VantageCloud
    972 Ratings
    Visit Website
  • AnalyticsCreator
    46 Ratings
    Visit Website
  • Google Cloud SQL
    541 Ratings
    Visit Website
  • icCube
    30 Ratings
    Visit Website
  • RaimaDB
    5 Ratings
    Visit Website
  • Amazon Web Services (AWS)
    4,300 Ratings
    Visit Website
  • Hightouch
    437 Ratings
    Visit Website
  • Docket
    53 Ratings
    Visit Website
  • CartonCloud
    78 Ratings

About

More customers pick Amazon Redshift than any other cloud data warehouse. Redshift powers analytical workloads for Fortune 500 companies, startups, and everything in between. Companies like Lyft have grown with Redshift from startups to multi-billion dollar enterprises. No other data warehouse makes it as easy to gain new insights from all your data. With Redshift you can query petabytes of structured and semi-structured data across your data warehouse, operational database, and your data lake using standard SQL. Redshift lets you easily save the results of your queries back to your S3 data lake using open formats like Apache Parquet to further analyze from other analytics services like Amazon EMR, Amazon Athena, and Amazon SageMaker. Redshift is the world’s fastest cloud data warehouse and gets faster every year. For performance intensive workloads you can use the new RA3 instances to get up to 3x the performance of any cloud data warehouse.

About

We created Parquet to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem. Parquet is built from the ground up with complex nested data structures in mind, and uses the record shredding and assembly algorithm described in the Dremel paper. We believe this approach is superior to simple flattening of nested namespaces. Parquet is built to support very efficient compression and encoding schemes. Multiple projects have demonstrated the performance impact of applying the right compression and encoding scheme to the data. Parquet allows compression schemes to be specified on a per-column level, and is future-proofed to allow adding more encodings as they are invented and implemented. Parquet is built to be used by anyone. The Hadoop ecosystem is rich with data processing frameworks, and we are not interested in playing favorites.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Companies that need a powerful and fast cloud data warehouse solution

Audience

Individuals requiring a columnar storage solution available to any project in the Hadoop ecosystem

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$0.25 per hour
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Amazon
Founded: 1994
United States
aws.amazon.com/redshift/

Company Information

The Apache Software Foundation
Founded: 1999
United States
parquet.apache.org

Alternatives

Alternatives

Apache Iceberg

Apache Iceberg

Apache Software Foundation
Amazon S3

Amazon S3

Amazon
Apache HBase

Apache HBase

The Apache Software Foundation
Apache Kudu

Apache Kudu

The Apache Software Foundation

Categories

Categories

Integrations

Amazon Data Firehose
Amazon SageMaker Data Wrangler
Blotout
Gravity Data
Indexima Data Hub
Mage Platform
Mage Sensitive Data Discovery
Meltano
PuppyGraph
SDF
SSIS Integration Toolkit
StarfishETL
Streamkap
Timbr.ai
Tonic Ephemeral
DataOps DataFlow
RATH
Sequelize
Sopact Impact Cloud
Style Intelligence

Integrations

Amazon Data Firehose
Amazon SageMaker Data Wrangler
Blotout
Gravity Data
Indexima Data Hub
Mage Platform
Mage Sensitive Data Discovery
Meltano
PuppyGraph
SDF
SSIS Integration Toolkit
StarfishETL
Streamkap
Timbr.ai
Tonic Ephemeral
DataOps DataFlow
RATH
Sequelize
Sopact Impact Cloud
Style Intelligence
Claim Amazon Redshift and update features and information
Claim Amazon Redshift and update features and information
Claim Apache Parquet and update features and information
Claim Apache Parquet and update features and information