Data science spreadsheet with Python & SQL
Fast, flexible and powerful Python data analysis toolkit
Build, run, and manage data pipelines for integrating data
Machine learning in Python
Docker image used to run data processing workloads
A data management tool that enables working with other SQL tools
RStudio is an integrated development environment (IDE) for R
Python ETL framework for stream processing, real-time analytics, LLM
Parallel computing with task scheduling
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
StarRocks is a next-gen sub-second MPP database for full analytics
Python module that helps you build complex pipelines of batch jobs
Dataset Management Framework, a Python library and a CLI tool to build
A Python package for interactive geospaital analysis and visualization
Cross-platform C++ libraries for building network applications
A reactive notebook for Python
R interface for Apache Spark
CKAN is an open-source DMS for powering data hubs
Making DAG construction easier
Stream Processing and Complex Event Processing Engine
Repository for Digital Earth Australia Jupyter Notebooks
A Python toolbox for gaining geometric insights
WebGL-based viewer for volumetric data
Interactive visualization tools for Julia