Training data (data labeling, annotation, workflow) for all data types
Automatically find issues in image datasets
Always know what to expect from your data
Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
CKAN is an open-source DMS for powering data hubs
Collaborative forensic timeline analysis
Clean Jupyter notebooks of outputs, metadata, and empty cells
High-Performance Symbolic Regression in Python and Julia
Making DAG construction easier
Streamline your ML workflow
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Recap tracks and transform schemas across your whole application
Make your own running home page
The open standard for data logging
airda(Air Data Agent
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
A more accurate representation of jupyter notebooks
Library providing end-to-end GPU-accelerated recommender systems
An open source multi-tool for exploring and publishing data
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Python implementation of global optimization with gaussian processes
Production-ready data processing made easy and shareable