Kubeflow’s superfood for Data Scientists
Great Expectations Airflow operator
Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Visualize and compare datasets, target values and associations
An AI-powered data science team of agents
Clean Jupyter notebooks of outputs, metadata, and empty cells
High-Performance Symbolic Regression in Python and Julia
Making DAG construction easier
An interactive Formula 1 race visualisation and data analysis tool
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
A more accurate representation of jupyter notebooks
Collaborative forensic timeline analysis
An open source multi-tool for exploring and publishing data
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Python implementation of global optimization with gaussian processes
Create HTML profiling reports from pandas DataFrame objects
Open-source data observability for analytics engineers
Diagram generation for understanding codebases and system architecture
The open standard for data logging
Benchmarking synthetic data generation methods
Scale your Pandas workflows by changing a single line of code
Synthetic data generators for structured and unstructured text