Unified metadata lake for data & AI assets.
MCPower — simple Monte Carlo power analysis for complex models
SADSA (Software Application for Data Science and Analytics)
Real-time, incremental ETL library for ML with record-level depend
OpenPTV
a package with useful scripts for X-ray diffraction physicists
Uma Ferramenta Computacional para Análise e Recuperação de Patentes
Statistical data visualization in Python
Fast and efficient plotting of images inside Python Notebooks
re_data - fix data issues before your users & CEO would discover them
A lightweight opinionated ETL framework, halfway between plain scripts
Code review for data in dbt
Open-source metadata collector based on ODD Specification
Serve machine learning models within a Docker container
Open-source GCP metadata collector based on ODD Specification
All-in-one text de-duplication
many useful snippets for using python in a laboratory
Deep neural networks for density functional theory Hamiltonian
Reference mapping for single-cell genomics
Streaming reactive and dataflow graphs in Python
Swiple enables you to easily observe, understand, validate data
Build data pipelines, the easy way
Missing data visualization module for Python