Apache DataFusionApache Software Foundation
|
Rons Data StreamRons Place Software
|
|||||
Related Products
|
||||||
About
Apache DataFusion is an extensible, high-performance query engine written in Rust that utilizes Apache Arrow as its in-memory format. Designed for developers building data-centric systems such as databases, data frames, machine learning, and streaming applications, DataFusion offers SQL and DataFrame APIs, a vectorized, multi-threaded, streaming execution engine, and support for partitioned data sources. It natively supports formats like CSV, Parquet, JSON, and Avro, and allows for seamless integration with object stores including AWS S3, Azure Blob Storage, and Google Cloud Storage. The engine features a comprehensive query planner, a state-of-the-art optimizer with capabilities like expression coercion and simplification, projection and filter pushdown, sort and distribution-aware optimizations, and automatic join reordering. DataFusion is highly customizable, enabling the addition of user-defined scalar, aggregate, and window functions, custom data sources, query languages, etc.
|
About
Rons Data Stream is a windows application designed to clean, or update, multiple data sources within seconds, whatever the size of the files, through the use of Cleaners.
"Cleaners" are made up of a list of operations that are selected from a broad list of Column, Row and Cell processing rules. They can be built, saved and applied to as many data sources as required, and re-used with as many Jobs as needed. The Preview window displays both the original data and a preview of the processed data. The result of each rule is therefore very clear and comprehensible.
"Jobs" contain all the detail needed for batch processing allowing 100's of data files to be processed in one go, making cleaning a whole directory an easy task.
Rons Data Stream handles tabular text formats (CSV, HMTL, XML files and tokenized formats), SQL and Parquet, from loading to converting. It can work individually or hand in hand with Rons Data Edit, adding power to both applications.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Professional developers and data engineers seeking a solution for building data-centric systems
|
Audience
Users that need a batch CSV processor to clean (multiple) data files automatically
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
$35
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationApache Software Foundation
Founded: 2019
United States
datafusion.apache.org
|
Company InformationRons Place Software
Founded: 2013
Canada
www.ronsplace.ca/products/ronsdatastream
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
Amazon S3
Apache Arrow
Apache Avro
Apache Parquet
Azure Blob Storage
C
Google Cloud Storage
Google Sheets
JSON
Microsoft Excel
|
Integrations
Amazon S3
Apache Arrow
Apache Avro
Apache Parquet
Azure Blob Storage
C
Google Cloud Storage
Google Sheets
JSON
Microsoft Excel
|
|||||
|
|
|