CKAN is an open-source DMS for powering data hubs
Synthetic data generators for structured and unstructured text
Automatic extraction of relevant features from time series
Streamline your ML workflow
Automatically find issues in image datasets
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Efficiently diff rows across two different databases
Interactive visualization tools for Julia
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Optimal transport algorithms for Julia
Project structure for doing and sharing data science work
Scalable and Flexible Gradient Boosting
An AI-powered data science team of agents
Clean Jupyter notebooks of outputs, metadata, and empty cells
A package for Counterfactual Explanations and Algorithmic Recourse
Collaborative forensic timeline analysis
Repository for Digital Earth Australia Jupyter Notebooks
High-level, high-performance dynamic language for technical computing
Open-source data observability for analytics engineers
Algorithms from circuit theory to predict connectivity
Graphical User Interface Toolkit for Python with minimal dependencies
Always know what to expect from your data
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Production-ready data processing made easy and shareable
TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox