Training data (data labeling, annotation, workflow) for all data types
Uncover insights, surface problems, monitor, and fine tune your LLM
The open-source tool for building high-quality datasets
Create HTML profiling reports from pandas DataFrame objects
The standard data-centric AI package for data quality and ML
Benchmarking synthetic data generation methods
Generalized Interoperability and Strong AI
NBi is a testing framework (add-on to NUnit)
EZStacking is Jupyter notebook generator for machine learning
osDQ dedicated to create apache spark based data pipeline using JSON
Data quality analysis, profiling, cleansing, duplicate detection +more
This is sister project for osDQ which provide Restful APIs