Data integration platform for ELT pipelines from APIs, databases
A curated list of data mining papers about fraud detection
Data science on data without acquiring a copy
Train machine learning models within Docker containers
Recap tracks and transform schemas across your whole application
Great Expectations Airflow operator
Spatial data processing for geomodeling
The open standard for data logging
A Python toolbox for gaining geometric insights
Light-weight, flexible, expressive statistical data testing library
3D plotting and mesh analysis through a streamlined interface
Library providing end-to-end GPU-accelerated recommender systems
Docker image used to run data processing workloads
Integrate multiple high-dimensional datasets with fuzzy k-means
Mie scattering of light by perfect spheres
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Burp Suite extension for JavaScript static analysis
An AI-powered data science team of agents
Making DAG construction easier
Benchmarking synthetic data generation methods
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Materials and IPython notebooks for "Python for Data Analysis"
A tool for semi-automatic cell type classification, harmonization
The power of Chart.js with Python
Open-source data observability for analytics engineers