CSV Lint plug-in for Notepad++ for syntax highlighting
A tool to help improve data quality standards in data science
An orchestration platform for the development, production
Uncover insights, surface problems, monitor, and fine tune your LLM
lakeFS - Git-like capabilities for your object storage
Create HTML profiling reports from pandas DataFrame objects
The toolkit to test, validate, and evaluate your models and surface
The standard data-centric AI package for data quality and ML
Create HTML profiling reports from pandas DataFrame objects
Automatically find issues in image datasets
The open-source tool for building high-quality datasets
Efficiently diff rows across two different databases
Great Expectations Airflow operator
Qualitis is a one-stop data quality management platform
First open-source data discovery and observability platform
Training data (data labeling, annotation, workflow) for all data types
Synthetic data generators for structured and unstructured text
Benchmarking synthetic data generation methods
re_data - fix data issues before your users & CEO would discover them
re_data - fix data issues before your users & CEO would discover them
An easy, extensible web based IT service management platform
Generalized Interoperability and Strong AI
NBi is a testing framework (add-on to NUnit)
A scalable, unified data and AI engineering platform for enterprise
Lightweight library to write, orchestrate and test your SQL ETL