Benchmarking synthetic data generation methods
Open-source DORA metrics platform for engineering teams
Create HTML profiling reports from pandas DataFrame objects
The open standard for data logging
Video stabilization using gyroscope data
A Julia package for data clustering
Curated list of classic, high-quality computer science books
Synthetic data curation for post-training and data extraction
Scalable master data management and identity resolution
A high-quality tool for convert PDF to Markdown and JSON
Library to encode and decode images in WebP format
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Great Expectations Airflow operator
Easily generate information-rich, publication-quality tables from R
Open-source data observability for analytics engineers
Data quality assessment and metadata reporting for data frames
Log management solution that improves the performance of SIEM
Raspberry Pi config for all things Internet
DataCap is integrated software for data transformation
Dataset Management Framework, a Python library and a CLI tool to build
A JavaScript framework for creating ambitious web applications
Flexible Photo Recrafting While Preserving Your Identity
The best JavaScript Data Table for building enterprise applications
A Gem for creating partial anonymized dumps of your database
Diablo build for modern operating systems