Python module that helps you build complex pipelines of batch jobs
Parallel computing with task scheduling
Python scripts for ETL (extract, transform and load) jobs for Ethereum
Data integration platform for ELT pipelines from APIs, databases
Great Expectations Airflow operator
Light-weight, flexible, expressive statistical data testing library
Mie scattering of light by perfect spheres
A tool for semi-automatic cell type classification, harmonization
A cross-platform installer for the Julia programming language
Progress bars for threading and multiprocessing tasks on terminal
Streamline your ML workflow
Docker image used to run data processing workloads
Pythonic tool for running machine-learning/high performance workflows
Main repository for Vispy
WebGL-based viewer for volumetric data
Spatial data processing for geomodeling
Integrate multiple high-dimensional datasets with fuzzy k-means
Create HTML profiling reports from pandas DataFrame objects
Uncover insights, surface problems, monitor, and fine tune your LLM
Clean Jupyter notebooks of outputs, metadata, and empty cells
3D plotting and mesh analysis through a streamlined interface
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
A lightweight opinionated ETL framework, halfway between plain scripts
Data science on data without acquiring a copy
The toolkit to test, validate, and evaluate your models and surface