Python module that helps you build complex pipelines of batch jobs
AI-data warehouse to enrich, transform and analyze unstructured data
A Scala API for Apache Beam and Google Cloud Dataflow
Julia DataFrames serialization format
Great Expectations Airflow operator
Python ETL framework for stream processing, real-time analytics, LLM
Massive parallel data platform for analytics, machine learning and AI
A ranked list of awesome Python open-source libraries
Build, run, and manage data pipelines for integrating data
A native Julia code for lattice QCD with dynamical fermions in 4D
Official Julia implementation of Apache Arrow
Open source framework for processing, monitoring, and alerting
Distributed stream processing engine in Rust
EEGLAB is an open source signal processing environment
Integrate multiple high-dimensional datasets with fuzzy k-means
Distributed messaging and streaming platform with low latency
Upserts, Deletes And Incremental Processing on Big Data
Apache InLong - a one-stop integration framework for massive data
Efficiently diff rows across two different databases
A graph database that supports more than 100+ billion data
Docker image used to run data processing workloads
General Mission Analysis Tool
show and edit eulumdat files