Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
matplotlib: plotting with Python
Machine learning in Python
Data integration platform for ELT pipelines from APIs, databases
A high-quality tool for convert PDF to Markdown and JSON
Python data, Leaflet.js maps
CKAN is an open-source DMS for powering data hubs
Collaborative forensic timeline analysis
Create HTML profiling reports from pandas DataFrame objects
Create HTML profiling reports from pandas DataFrame objects
Training data (data labeling, annotation, workflow) for all data types
The open-source tool for building high-quality datasets
Always know what to expect from your data
AI-data warehouse to enrich, transform and analyze unstructured data
WebGL-based viewer for volumetric data
The standard data-centric AI package for data quality and ML
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Python Stream Processing
The toolkit to test, validate, and evaluate your models and surface
Monitor the stability of a Pandas or Spark dataframe
Uncover insights, surface problems, monitor, and fine tune your LLM
Build beautiful web-based analytic apps, no JavaScript required
Making DAG construction easier
Synthetic data generators for structured and unstructured text