Data science spreadsheet with Python & SQL
Fast, flexible and powerful Python data analysis toolkit
Build, run, and manage data pipelines for integrating data
A data management tool that enables working with other SQL tools
Machine learning in Python
Docker image used to run data processing workloads
RStudio is an integrated development environment (IDE) for R
Python ETL framework for stream processing, real-time analytics, LLM
Dataset Management Framework, a Python library and a CLI tool to build
Parallel computing with task scheduling
A Python package for interactive geospaital analysis and visualization
Python module that helps you build complex pipelines of batch jobs
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
StarRocks is a next-gen sub-second MPP database for full analytics
Interactive visualization tools for Julia
A reactive notebook for Python
R interface for Apache Spark
A Python toolbox for gaining geometric insights
Making DAG construction easier
Cross-platform C++ libraries for building network applications
CKAN is an open-source DMS for powering data hubs
Stream Processing and Complex Event Processing Engine
Repository for Digital Earth Australia Jupyter Notebooks
A Python package for interactive mapping and geospatial analysis
Positron, a next-generation data science IDE