Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
matplotlib: plotting with Python
Data integration platform for ELT pipelines from APIs, databases
Machine learning in Python
CKAN is an open-source DMS for powering data hubs
A high-quality tool for convert PDF to Markdown and JSON
Python data, Leaflet.js maps
Collaborative forensic timeline analysis
The open-source tool for building high-quality datasets
Create HTML profiling reports from pandas DataFrame objects
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Create HTML profiling reports from pandas DataFrame objects
The toolkit to test, validate, and evaluate your models and surface
Always know what to expect from your data
WebGL-based viewer for volumetric data
The standard data-centric AI package for data quality and ML
AI-data warehouse to enrich, transform and analyze unstructured data
Training data (data labeling, annotation, workflow) for all data types
Python Stream Processing
Uncover insights, surface problems, monitor, and fine tune your LLM
Monitor the stability of a Pandas or Spark dataframe
Build beautiful web-based analytic apps, no JavaScript required
Making DAG construction easier
Synthetic data generators for structured and unstructured text