Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
Machine learning in Python
An orchestration platform for the development, production
matplotlib: plotting with Python
Python ETL framework for stream processing, real-time analytics, LLM
CKAN is an open-source DMS for powering data hubs
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Dataset Management Framework, a Python library and a CLI tool to build
Create HTML profiling reports from pandas DataFrame objects
Python data, Leaflet.js maps
Positron, a next-generation data science IDE
A cross-platform installer for the Julia programming language
Recap tracks and transform schemas across your whole application
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
A Python toolbox for gaining geometric insights
Light-weight, flexible, expressive statistical data testing library
The open-source tool for building high-quality datasets
Parallel computing with task scheduling
Python Stream Processing
The open standard for data logging
Uncover insights, surface problems, monitor, and fine tune your LLM
Making DAG construction easier