BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Recap tracks and transform schemas across your whole application
Make your own running home page
Always know what to expect from your data
High-Performance Symbolic Regression in Python and Julia
Making DAG construction easier
A more accurate representation of jupyter notebooks
An orchestration platform for the development, production
Library providing end-to-end GPU-accelerated recommender systems
Training data (data labeling, annotation, workflow) for all data types
Create HTML profiling reports from pandas DataFrame objects
airda(Air Data Agent
Detecting silent model failure. NannyML estimates performance
The open-source tool for building high-quality datasets
Efficiently diff rows across two different databases
A real-time visualisation of the CO2 emissions of electricity
CKAN is an open-source DMS for powering data hubs
Automatically find issues in image datasets
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Production-ready data processing made easy and shareable
re_data - fix data issues before your users & CEO would discover them
Open-source data observability for analytics engineers
Benchmarking synthetic data generation methods
Python implementation of global optimization with gaussian processes