Open-source data observability for analytics engineers
A package for Counterfactual Explanations and Algorithmic Recourse
Always know what to expect from your data
Julia Devito inversion
Production-ready data processing made easy and shareable
Automatically find issues in image datasets
Beautiful and flexible vizualizations of high dimensional data
Create HTML profiling reports from pandas DataFrame objects
High-level, high-performance dynamic language for technical computing
Train machine learning models within Docker containers
Privacy and Security focused Segment-alternative, in Golang
The open standard for data logging
Uncover insights, surface problems, monitor, and fine tune your LLM
Python implementation of global optimization with gaussian processes
Beta Machine Learning Toolkit
Training data (data labeling, annotation, workflow) for all data types
A data visualization and analytics component
A real-time visualisation of the CO2 emissions of electricity
Repository for Digital Earth Australia Jupyter Notebooks
Exploratory analysis of Bayesian models with Julia
ETL framework to index data for AI, such as RAG
Scale your Pandas workflows by changing a single line of code
Synthetic data generators for structured and unstructured text
Project structure for doing and sharing data science work
Automatic extraction of relevant features from time series