Always know what to expect from your data
Python module that helps you build complex pipelines of batch jobs
An orchestration platform for the development, production
Convert Python notebook to web app and share with non-technical users
An open source multi-tool for exploring and publishing data
Train machine learning models within Docker containers
Docker image used to run data processing workloads
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
AutoGluon: AutoML for Image, Text, and Tabular Data
Fast, flexible and powerful Python data analysis toolkit
The open-source tool for building high-quality datasets
CKAN is an open-source DMS for powering data hubs
Orange: Interactive data analysis
Build beautiful web-based analytic apps, no JavaScript required
Data science on data without acquiring a copy
Training data (data labeling, annotation, workflow) for all data types
Statistical data visualization in Python
A Python package for interactive mapping and geospatial analysis
Making DAG construction easier
The power of Chart.js with Python
Great Expectations Airflow operator
Automatically find issues in image datasets
Data integration platform for ELT pipelines from APIs, databases
Benchmarking synthetic data generation methods
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.