Apache DataFusion

Apache DataFusion

Apache Software Foundation
+
+

Related Products

  • StarTree
    25 Ratings
    Visit Website
  • Azore CFD
    14 Ratings
    Visit Website
  • SureSync
    13 Ratings
    Visit Website
  • RaimaDB
    5 Ratings
    Visit Website
  • Epsilon3
    259 Ratings
    Visit Website
  • FusionAuth
    119 Ratings
    Visit Website
  • Google Cloud BigQuery
    1,731 Ratings
    Visit Website
  • Vertex AI
    713 Ratings
    Visit Website
  • Kubit
    33 Ratings
    Visit Website
  • QuantaStor
    6 Ratings
    Visit Website

About

Apache DataFusion is an extensible, high-performance query engine written in Rust that utilizes Apache Arrow as its in-memory format. Designed for developers building data-centric systems such as databases, data frames, machine learning, and streaming applications, DataFusion offers SQL and DataFrame APIs, a vectorized, multi-threaded, streaming execution engine, and support for partitioned data sources. It natively supports formats like CSV, Parquet, JSON, and Avro, and allows for seamless integration with object stores including AWS S3, Azure Blob Storage, and Google Cloud Storage. The engine features a comprehensive query planner, a state-of-the-art optimizer with capabilities like expression coercion and simplification, projection and filter pushdown, sort and distribution-aware optimizations, and automatic join reordering. DataFusion is highly customizable, enabling the addition of user-defined scalar, aggregate, and window functions, custom data sources, query languages, etc.

About

Infinite retention for Apache Kafka® with Confluent. Be infrastructure-enabled, not infrastructure-restricted Legacy technologies require you to choose between being real-time or highly-scalable. Event streaming enables you to innovate and win - by being both real-time and highly-scalable. Ever wonder how your rideshare app analyzes massive amounts of data from multiple sources to calculate real-time ETA? Ever wonder how your credit card company analyzes millions of credit card transactions across the globe and sends fraud notifications in real-time? The answer is event streaming. Move to microservices. Enable your hybrid strategy through a persistent bridge to cloud. Break down silos to demonstrate compliance. Gain real-time, persistent event transport. The list is endless.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Professional developers and data engineers seeking a solution for building data-centric systems

Audience

Companies that want to create real-time data pipelines

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Apache Software Foundation
Founded: 2019
United States
datafusion.apache.org

Company Information

Confluent
Founded: 2015
United States
www.confluent.io

Alternatives

AnySQL Maestro

AnySQL Maestro

SQL Maestro Group

Alternatives

Amazon MSK

Amazon MSK

Amazon
HyperSQL DataBase

HyperSQL DataBase

The hsql Development Group

Categories

Categories

Streaming Analytics Features

Data Enrichment
Data Wrangling / Data Prep
Multiple Data Source Support
Process Automation
Real-time Analysis / Reporting
Visualization Dashboards

Integrations

APERIO DataWise
Ably
Amazon S3
Amazon Web Services (AWS)
Apache Arrow
Arroyo
Azure Blob Storage
Borneo
Diamanti
Google Cloud Pub/Sub
Google Cloud Storage
InsightFinder
Knoldus
Observo AI
Onehouse
Parasoft
SAP Business Data Cloud
SQL
Scuba
Theom

Integrations

APERIO DataWise
Ably
Amazon S3
Amazon Web Services (AWS)
Apache Arrow
Arroyo
Azure Blob Storage
Borneo
Diamanti
Google Cloud Pub/Sub
Google Cloud Storage
InsightFinder
Knoldus
Observo AI
Onehouse
Parasoft
SAP Business Data Cloud
SQL
Scuba
Theom
Claim Apache DataFusion and update features and information
Claim Apache DataFusion and update features and information
Claim Confluent and update features and information
Claim Confluent and update features and information