A lightweight opinionated ETL framework, halfway between plain scripts
Great Expectations Airflow operator
Automatically find issues in image datasets
Always know what to expect from your data
Monitor the stability of a Pandas or Spark dataframe
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Clean Jupyter notebooks of outputs, metadata, and empty cells
High-Performance Symbolic Regression in Python and Julia
Making DAG construction easier
Streamline your ML workflow
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
The open standard for data logging
airda(Air Data Agent
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
A more accurate representation of jupyter notebooks
CKAN is an open-source DMS for powering data hubs
Library providing end-to-end GPU-accelerated recommender systems
Collaborative forensic timeline analysis
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Python implementation of global optimization with gaussian processes
The standard data-centric AI package for data quality and ML
Create HTML profiling reports from pandas DataFrame objects
Production-ready data processing made easy and shareable
re_data - fix data issues before your users & CEO would discover them