A tool to help improve data quality standards in data science
Automatically find issues in image datasets
Create HTML profiling reports from pandas DataFrame objects
Synthetic data generators for structured and unstructured text
Training data (data labeling, annotation, workflow) for all data types
Unified metadata lake for data & AI assets.
Open source Extract Transform Load engine written in Java