Blue Whale smart cloud configuration platform
Project structure for doing and sharing data science work
Automatic extraction of relevant features from time series
Distributed scheduled job framework
Graph theory library for visualization and analysis
Apache Polaris, the interoperable, open source catalog
Fast and streamable Excalidraw MCP App
A simple and composable way to validate data in JavaScript
Positron, a next-generation data science IDE
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
brms R package for Bayesian generalized multivariate models using Stan
High-performance, open source distributed transaction solution
TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
Repository for Digital Earth Australia Jupyter Notebooks
A Python package for interactive geospaital analysis and visualization
Parallel file processing made easy
A scientific machine learning (SciML) wrapper for the FEniCS
Chemical reaction network and systems biology interface
Java 1-17 Parser and Abstract Syntax Tree for Java
Apache DevLake is an open-source dev data platform
Open Source Data Management Software
Data quality assessment and metadata reporting for data frames
Build, run, and manage data pipelines for integrating data
The standard data-centric AI package for data quality and ML
JuiceFS is a distributed POSIX file system built on top of Redis