Data science on data without acquiring a copy
Docker image used to run data processing workloads
Train machine learning models within Docker containers
Python module that helps you build complex pipelines of batch jobs
Detecting silent model failure. NannyML estimates performance
Python implementation of global optimization with gaussian processes
A reactive notebook for Python
Python scripts for ETL (extract, transform and load) jobs for Ethereum
3D plotting and mesh analysis through a streamlined interface
Uncover insights, surface problems, monitor, and fine tune your LLM
Benchmarking synthetic data generation methods
Efficiently diff rows across two different databases
Light-weight, flexible, expressive statistical data testing library
A more accurate representation of jupyter notebooks
High-Performance Symbolic Regression in Python and Julia
Automatically find issues in image datasets
Scale your Pandas workflows by changing a single line of code
re_data - fix data issues before your users & CEO would discover them
Great Expectations Airflow operator
Code review for data in dbt
A real-time visualisation of the CO2 emissions of electricity
Always know what to expect from your data
Clone with Python! Data structures for double stranded DNA
A Python package for interactive mapping and geospatial analysis
Build beautiful web-based analytic apps, no JavaScript required