A multi-cloud framework for big data analytics
Parallel computing with task scheduling
Python scripts for ETL (extract, transform and load) jobs for Ethereum
Great Expectations Airflow operator
Light-weight, flexible, expressive statistical data testing library
WebGL-based viewer for volumetric data
A real-time visualisation of the CO2 emissions of electricity
Mie scattering of light by perfect spheres
A cross-platform installer for the Julia programming language
A tool for semi-automatic cell type classification, harmonization
Data integration platform for ELT pipelines from APIs, databases
Progress bars for threading and multiprocessing tasks on terminal
Pythonic tool for running machine-learning/high performance workflows
Python module that helps you build complex pipelines of batch jobs
Main repository for Vispy
Integrate multiple high-dimensional datasets with fuzzy k-means
Uncover insights, surface problems, monitor, and fine tune your LLM
CKAN is an open-source DMS for powering data hubs
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
A lightweight opinionated ETL framework, halfway between plain scripts
Always know what to expect from your data
Create HTML profiling reports from pandas DataFrame objects
An orchestration platform for the development, production
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Spatial data processing for geomodeling