Apache DataFusionApache Software Foundation
|
Google Cloud DatalabGoogle
|
|||||
Related Products
|
||||||
About
Apache DataFusion is an extensible, high-performance query engine written in Rust that utilizes Apache Arrow as its in-memory format. Designed for developers building data-centric systems such as databases, data frames, machine learning, and streaming applications, DataFusion offers SQL and DataFrame APIs, a vectorized, multi-threaded, streaming execution engine, and support for partitioned data sources. It natively supports formats like CSV, Parquet, JSON, and Avro, and allows for seamless integration with object stores including AWS S3, Azure Blob Storage, and Google Cloud Storage. The engine features a comprehensive query planner, a state-of-the-art optimizer with capabilities like expression coercion and simplification, projection and filter pushdown, sort and distribution-aware optimizations, and automatic join reordering. DataFusion is highly customizable, enabling the addition of user-defined scalar, aggregate, and window functions, custom data sources, query languages, etc.
|
About
An easy-to-use interactive tool for data exploration, analysis, visualization, and machine learning. Cloud Datalab is a powerful interactive tool created to explore, analyze, transform, and visualize data and build machine learning models on Google Cloud Platform. It runs on Compute Engine and connects to multiple cloud services easily so you can focus on your data science tasks. Cloud Datalab is built on Jupyter (formerly IPython), which boasts a thriving ecosystem of modules and a robust knowledge base. Cloud Datalab enables analysis of your data on BigQuery, AI Platform, Compute Engine, and Cloud Storage using Python, SQL, and JavaScript (for BigQuery user-defined functions). Whether you're analyzing megabytes or terabytes, Cloud Datalab has you covered. Query terabytes of data in BigQuery, run local analysis on sampled data, and run training jobs on terabytes of data in AI Platform seamlessly.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Professional developers and data engineers seeking a solution for building data-centric systems
|
Audience
Anyone who needs an interactive tool for data exploration, analysis, visualization, and machine learning
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
No information available.
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationApache Software Foundation
Founded: 2019
United States
datafusion.apache.org
|
Company InformationGoogle
Founded: 1998
United States
cloud.google.com/datalab
|
|||||
Alternatives |
Alternatives |
|||||
|
||||||
|
|
|||||
|
|
|||||
Categories |
Categories |
|||||
Integrations
Amazon S3
Apache Arrow
Apache Avro
Apache Parquet
Azure Blob Storage
C
DataLab
Google Cloud Platform
Google Cloud Storage
Google Sheets
|
Integrations
Amazon S3
Apache Arrow
Apache Avro
Apache Parquet
Azure Blob Storage
C
DataLab
Google Cloud Platform
Google Cloud Storage
Google Sheets
|
|||||
|
|