Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
Python data, Leaflet.js maps
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Data integration platform for ELT pipelines from APIs, databases
Python ETL framework for stream processing, real-time analytics, LLM
Python Stream Processing
Machine learning in Python
CKAN is an open-source DMS for powering data hubs
A high-quality tool for convert PDF to Markdown and JSON
matplotlib: plotting with Python
Statistical data visualization in Python
Library providing end-to-end GPU-accelerated recommender systems
Training data (data labeling, annotation, workflow) for all data types
Spatial data processing for geomodeling
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Streamline your ML workflow
Project structure for doing and sharing data science work
A cross-platform installer for the Julia programming language
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Recap tracks and transform schemas across your whole application
Build, run, and manage data pipelines for integrating data
An orchestration platform for the development, production
Open-source data observability for analytics engineers
AutoGluon: AutoML for Image, Text, and Tabular Data