Automatically find issues in image datasets
Integrate multiple high-dimensional datasets with fuzzy k-means
Visualize and compare datasets, target values and associations
Docker image used to run data processing workloads
A small set of Python functions to draw pretty maps from OpenStreetMap
Create HTML profiling reports from pandas DataFrame objects
Experimental Julia implementation of the Amazon Braket SDK
Make your own running home page
Python implementation of global optimization with gaussian processes
Light-weight, flexible, expressive statistical data testing library
CKAN is an open-source DMS for powering data hubs
Streamline your ML workflow
3D plotting and mesh analysis through a streamlined interface
A toolkit to run Ray applications on Kubernetes
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Repository for Digital Earth Australia Jupyter Notebooks
A lightweight opinionated ETL framework, halfway between plain scripts
airda(Air Data Agent
Collection of handy tools for Go projects
A data visualization and analytics component
Synthetic data generators for structured and unstructured text
Training data (data labeling, annotation, workflow) for all data types
AutoGluon: AutoML for Image, Text, and Tabular Data
Efficiently diff rows across two different databases
A framework for real-life data science