Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
matplotlib: plotting with Python
Machine learning in Python
CKAN is an open-source DMS for powering data hubs
An orchestration platform for the development, production
Uncover insights, surface problems, monitor, and fine tune your LLM
Data integration platform for ELT pipelines from APIs, databases
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Python ETL framework for stream processing, real-time analytics, LLM
A cross-platform installer for the Julia programming language
Light-weight, flexible, expressive statistical data testing library
Python data, Leaflet.js maps
Positron, a next-generation data science IDE
Create HTML profiling reports from pandas DataFrame objects
Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
AI-data warehouse to enrich, transform and analyze unstructured data
Dataset Management Framework, a Python library and a CLI tool to build
Build beautiful web-based analytic apps, no JavaScript required
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Benchmarking synthetic data generation methods
Project structure for doing and sharing data science work
Parallel computing with task scheduling
Train machine learning models within Docker containers