The toolkit to test, validate, and evaluate your models and surface
Create HTML profiling reports from pandas DataFrame objects
Open-source data observability for analytics engineers
Convert Python notebook to web app and share with non-technical users
Repository for the Astropy core package
Python Stream Processing
Monitor the stability of a Pandas or Spark dataframe
The open standard for data logging
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
A Python toolbox for gaining geometric insights
Collaborative forensic timeline analysis
Clean Jupyter notebooks of outputs, metadata, and empty cells
High-Performance Symbolic Regression in Python and Julia
Scale your Pandas workflows by changing a single line of code
Making DAG construction easier
Light-weight, flexible, expressive statistical data testing library
Synthetic data generators for structured and unstructured text
The open-source tool for building high-quality datasets
Streamline your ML workflow
Efficiently diff rows across two different databases
Project structure for doing and sharing data science work
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
airda(Air Data Agent
The standard data-centric AI package for data quality and ML
Detecting silent model failure. NannyML estimates performance