Apache DataFusionApache Software Foundation
|
tapDigital Society
|
|||||
Related Products
|
||||||
About
Apache DataFusion is an extensible, high-performance query engine written in Rust that utilizes Apache Arrow as its in-memory format. Designed for developers building data-centric systems such as databases, data frames, machine learning, and streaming applications, DataFusion offers SQL and DataFrame APIs, a vectorized, multi-threaded, streaming execution engine, and support for partitioned data sources. It natively supports formats like CSV, Parquet, JSON, and Avro, and allows for seamless integration with object stores including AWS S3, Azure Blob Storage, and Google Cloud Storage. The engine features a comprehensive query planner, a state-of-the-art optimizer with capabilities like expression coercion and simplification, projection and filter pushdown, sort and distribution-aware optimizations, and automatic join reordering. DataFusion is highly customizable, enabling the addition of user-defined scalar, aggregate, and window functions, custom data sources, query languages, etc.
|
About
Turn spreadsheets and data files into production-ready APIs without writing backend code. Upload CSV, JSONL, Parquet and other formats, clean and join them with familiar SQL, and expose secure, documented endpoints instantly. Built-in features include auto-generated OpenAPI docs, API key security, geospatial filters with H3 indexing, usage monitoring, and high-performance queries. You can also download transformed datasets anytime to avoid vendor lock-in. Works for single files, combined datasets, or public data portals with minimal setup.
Key features
- Create secure, documented APIs directly from CSV, JSONL, and Parquet.
- Run familiar SQL queries to clean, join, and enrich data.
- No backend setup or servers to configure or maintain.
- Auto-generated OpenAPI documentation for every endpoint you create.
- Secure endpoints with API keys and isolated storage for safety.
- Geospatial filters, H3 indexing, and fast, optimised queries at scale.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Professional developers and data engineers seeking a solution for building data-centric systems
|
Audience
Developers needing to quickly operationalise data or data teams wanting to share data
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
$10/month
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationApache Software Foundation
Founded: 2019
United States
datafusion.apache.org
|
Company InformationDigital Society
Founded: 2023
United Kingdom
tapintodata.com
|
|||||
Alternatives |
Alternatives |
|||||
Categories |
Categories |
|||||
Integrations
Amazon S3
Apache Arrow
Apache Avro
Apache Parquet
Azure Blob Storage
C
Google Cloud Storage
Google Sheets
JSON
Microsoft Excel
|
Integrations
Amazon S3
Apache Arrow
Apache Avro
Apache Parquet
Azure Blob Storage
C
Google Cloud Storage
Google Sheets
JSON
Microsoft Excel
|
|||||
|
|
|