Dataset Management Framework, a Python library and a CLI tool to build
Python Stream Processing
Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
The open standard for data logging
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
A Python toolbox for gaining geometric insights
Visualize and compare datasets, target values and associations
Burp Suite extension for JavaScript static analysis
An AI-powered data science team of agents
Clean Jupyter notebooks of outputs, metadata, and empty cells
Scale your Pandas workflows by changing a single line of code
Making DAG construction easier
Light-weight, flexible, expressive statistical data testing library
Streamline your ML workflow
A real-time visualisation of the CO2 emissions of electricity
3D plotting and mesh analysis through a streamlined interface
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Build, run, and manage data pipelines for integrating data
The standard data-centric AI package for data quality and ML
Detecting silent model failure. NannyML estimates performance
Training data (data labeling, annotation, workflow) for all data types
AutoGluon: AutoML for Image, Text, and Tabular Data
Create HTML profiling reports from pandas DataFrame objects
Train machine learning models within Docker containers