Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
matplotlib: plotting with Python
Machine learning in Python
A high-quality tool for convert PDF to Markdown and JSON
The Pocket Datalab
Python data, Leaflet.js maps
CKAN is an open-source DMS for powering data hubs
Data integration platform for ELT pipelines from APIs, databases
AI-data warehouse to enrich, transform and analyze unstructured data
Create HTML profiling reports from pandas DataFrame objects
Training data (data labeling, annotation, workflow) for all data types
Create HTML profiling reports from pandas DataFrame objects
Synthetic data generators for structured and unstructured text
The open-source tool for building high-quality datasets
Python Stream Processing
Monitor the stability of a Pandas or Spark dataframe
Making DAG construction easier
Recap tracks and transform schemas across your whole application
The standard data-centric AI package for data quality and ML
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Always know what to expect from your data
A more accurate representation of jupyter notebooks
Python module that helps you build complex pipelines of batch jobs
Python implementation of global optimization with gaussian processes