Recap tracks and transform schemas across your whole application
Make your own running home page
Create HTML profiling reports from pandas DataFrame objects
High-level, high-performance dynamic language for technical computing
airda(Air Data Agent
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
A more accurate representation of jupyter notebooks
Library providing end-to-end GPU-accelerated recommender systems
Automatically find issues in image datasets
Efficiently diff rows across two different databases
Visualize and compare datasets, target values and associations
Benchmarking synthetic data generation methods
TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
AutoGluon: AutoML for Image, Text, and Tabular Data
Train machine learning models within Docker containers
Production-ready data processing made easy and shareable
Beautiful and flexible vizualizations of high dimensional data
re_data - fix data issues before your users & CEO would discover them
Open-source data observability for analytics engineers
Visualizer for pandas data structures
ETL framework to index data for AI, such as RAG
Beta Machine Learning Toolkit
The open standard for data logging
Collaborative forensic timeline analysis