Great Expectations Airflow operator
Create HTML profiling reports from pandas DataFrame objects
Library providing end-to-end GPU-accelerated recommender systems
A real-time visualisation of the CO2 emissions of electricity
Benchmarking synthetic data generation methods
Dataset Management Framework, a Python library and a CLI tool to build
Python Stream Processing
Monitor the stability of a Pandas or Spark dataframe
Pythonic tool for running machine-learning/high performance workflows
A Python toolbox for gaining geometric insights
An AI-powered data science team of agents
Clean Jupyter notebooks of outputs, metadata, and empty cells
Scale your Pandas workflows by changing a single line of code
Making DAG construction easier
Synthetic data generators for structured and unstructured text
Always know what to expect from your data
Project structure for doing and sharing data science work
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
WebGL-based viewer for volumetric data
A dedicated app for collecting thousands of POI for OpenStreetMap
The standard data-centric AI package for data quality and ML
AutoGluon: AutoML for Image, Text, and Tabular Data
Data science on data without acquiring a copy
Train machine learning models within Docker containers