Python module that helps you build complex pipelines of batch jobs
AI-data warehouse to enrich, transform and analyze unstructured data
A Scala API for Apache Beam and Google Cloud Dataflow
Julia DataFrames serialization format
Great Expectations Airflow operator
Python ETL framework for stream processing, real-time analytics, LLM
Clean network diagrams, One-time setup, zero upkeep
EEGLAB is an open source signal processing environment
Build, run, and manage data pipelines for integrating data
Massive parallel data platform for analytics, machine learning and AI
A ranked list of awesome Python open-source libraries
A native Julia code for lattice QCD with dynamical fermions in 4D
Official Julia implementation of Apache Arrow
Open source framework for processing, monitoring, and alerting
Distributed stream processing engine in Rust
Upserts, Deletes And Incremental Processing on Big Data
A graph database that supports more than 100+ billion data
Integrate multiple high-dimensional datasets with fuzzy k-means
Distributed messaging and streaming platform with low latency
Apache InLong - a one-stop integration framework for massive data
Efficiently diff rows across two different databases
Docker image used to run data processing workloads
General Mission Analysis Tool
show and edit eulumdat files