Benchmarking synthetic data generation methods
Lightweight library to write, orchestrate and test your SQL ETL
Automatically find issues in image datasets
re_data - fix data issues before your users & CEO would discover them
re_data - fix data issues before your users & CEO would discover them
A tool to help improve data quality standards in data science
A scalable, unified data and AI engineering platform for enterprise
An easy, extensible web based IT service management platform
Mentalese Database Engine
World's first open source data quality & data preparation project
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
Data quality analysis, profiling, cleansing, duplicate detection +more
Simple Scientific Workflow System for CAGE Analysis
This is sister project for osDQ which provide Restful APIs