The standard data-centric AI package for data quality and ML
Training data (data labeling, annotation, workflow) for all data types
AutoGluon: AutoML for Image, Text, and Tabular Data
Best practices on recommendation systems
Fast, flexible and powerful Python data analysis toolkit
Orange: Interactive data analysis
CKAN is an open-source DMS for powering data hubs
Machine learning in Python
matplotlib: plotting with Python
Survival analysis in Python
Synthetic data generators for structured and unstructured text
Python data, Leaflet.js maps
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Python ETL framework for stream processing, real-time analytics, LLM
Uncover insights, surface problems, monitor, and fine tune your LLM
A cross-platform installer for the Julia programming language
Create HTML profiling reports from pandas DataFrame objects
Pythonic tool for running machine-learning/high performance workflows
An orchestration platform for the development, production
Python Stream Processing
Clone with Python! Data structures for double stranded DNA
WebGL-based viewer for volumetric data
A Python package for interactive geospaital analysis and visualization
Progress bars for threading and multiprocessing tasks on terminal