Training data (data labeling, annotation, workflow) for all data types
AutoGluon: AutoML for Image, Text, and Tabular Data
Light-weight, flexible, expressive statistical data testing library
Recap tracks and transform schemas across your whole application
Docker image used to run data processing workloads
The open standard for data logging
WebGL-based viewer for volumetric data
Clone with Python! Data structures for double stranded DNA
Library providing end-to-end GPU-accelerated recommender systems
Concurrent Python made simple
Great Expectations Airflow operator
Open-source data observability for analytics engineers
Main repository for Vispy
Build, run, and manage data pipelines for integrating data
Make your own running home page
Integrate multiple high-dimensional datasets with fuzzy k-means
Dataset Management Framework, a Python library and a CLI tool to build
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Mie scattering of light by perfect spheres
Uncover insights, surface problems, monitor, and fine tune your LLM
An AI-powered data science team of agents
A curated list of data mining papers about fraud detection
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Materials and IPython notebooks for "Python for Data Analysis"
Metadata and data identification tool and Python library