Parallel computing with task scheduling
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
A more accurate representation of jupyter notebooks
Survival analysis in Python
Kubeflow’s superfood for Data Scientists
Create HTML profiling reports from pandas DataFrame objects
Open-source data observability for analytics engineers
Main repository for Vispy
The open-source tool for building high-quality datasets
A real-time visualisation of the CO2 emissions of electricity
Always know what to expect from your data
The standard data-centric AI package for data quality and ML
Training data (data labeling, annotation, workflow) for all data types
A curated list of data mining papers about fraud detection
Python implementation of global optimization with gaussian processes
A Python package for interactive mapping and geospatial analysis
Concurrent Python made simple
AI-data warehouse to enrich, transform and analyze unstructured data
Great Expectations Airflow operator
An interactive Formula 1 race visualisation and data analysis tool
Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
A Python toolbox for gaining geometric insights
Build beautiful web-based analytic apps, no JavaScript required
A reactive notebook for Python