Great Expectations Airflow operator
Uncover insights, surface problems, monitor, and fine tune your LLM
Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Visualize and compare datasets, target values and associations
An AI-powered data science team of agents
Clean Jupyter notebooks of outputs, metadata, and empty cells
High-Performance Symbolic Regression in Python and Julia
Making DAG construction easier
Training data (data labeling, annotation, workflow) for all data types
An interactive Formula 1 race visualisation and data analysis tool
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
A more accurate representation of jupyter notebooks
A real-time visualisation of the CO2 emissions of electricity
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
An open source multi-tool for exploring and publishing data
Python implementation of global optimization with gaussian processes
Create HTML profiling reports from pandas DataFrame objects
Open-source data observability for analytics engineers
Diagram generation for understanding codebases and system architecture
The open standard for data logging
Benchmarking synthetic data generation methods
Collaborative forensic timeline analysis