Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
Python data, Leaflet.js maps
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Data integration platform for ELT pipelines from APIs, databases
Python ETL framework for stream processing, real-time analytics, LLM
Spatial data processing for geomodeling
Python Stream Processing
CKAN is an open-source DMS for powering data hubs
A high-quality tool for convert PDF to Markdown and JSON
Recap tracks and transform schemas across your whole application
matplotlib: plotting with Python
Machine learning in Python
Statistical data visualization in Python
Library providing end-to-end GPU-accelerated recommender systems
Training data (data labeling, annotation, workflow) for all data types
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Streamline your ML workflow
Project structure for doing and sharing data science work
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
A cross-platform installer for the Julia programming language
An orchestration platform for the development, production
Open-source data observability for analytics engineers
AutoGluon: AutoML for Image, Text, and Tabular Data
An open source multi-tool for exploring and publishing data