Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
Machine learning in Python
CKAN is an open-source DMS for powering data hubs
matplotlib: plotting with Python
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Survival analysis in Python
An orchestration platform for the development, production
Create HTML profiling reports from pandas DataFrame objects
Create HTML profiling reports from pandas DataFrame objects
Python ETL framework for stream processing, real-time analytics, LLM
Progress bars for threading and multiprocessing tasks on terminal
A cross-platform installer for the Julia programming language
Python Stream Processing
Pythonic tool for running machine-learning/high performance workflows
Python data, Leaflet.js maps
Monitor the stability of a Pandas or Spark dataframe
Synthetic data generators for structured and unstructured text
The standard data-centric AI package for data quality and ML
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
A multi-cloud framework for big data analytics
Kubeflow’s superfood for Data Scientists
Clone with Python! Data structures for double stranded DNA