A toolkit to run Ray applications on Kubernetes
Gradle plugin that adds a 'taskTree' task that prints task dependency
A data visualization and analytics component
Distributed stream processing engine in Rust
Fluid, elastic data abstraction and acceleration for BigData/AI apps
Boosted trees in Julia
ETL framework to index data for AI, such as RAG
Kestra is an infinitely scalable orchestration and scheduling platform
Open Data, more than 50 financial data
Valkey & Redis Java client. Real-Time Data Platform
A reactive notebook for Python
Metadata/data identification Java library
Python module that helps you build complex pipelines of batch jobs
R Interface to Python
Lightweight + fast physical quantities in Julia
Distributed Big Data Orchestration Service
Open Source Data Orchestration for the Cloud
Python Stream Processing
Backstage is an open platform for building developer portals
WebGL-based viewer for volumetric data
lakeFS - Git-like capabilities for your object storage
The open big data serving engine
DBMS supporting graph, document, full-text and geospatial models
Metadata and data identification tool and Python library
pprof is a tool for visualization and analysis of profiling data