Efficiently diff rows across two different databases
Orange: Interactive data analysis
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Project structure for doing and sharing data science work
An AI-powered data science team of agents
Fast, flexible and powerful Python data analysis toolkit
Yahoo! Finance market data downloader
Machine learning in Python
Data integration platform for ELT pipelines from APIs, databases
Training data (data labeling, annotation, workflow) for all data types
Python data, Leaflet.js maps
CKAN is an open-source DMS for powering data hubs
The open-source tool for building high-quality datasets
Create HTML profiling reports from pandas DataFrame objects
Synthetic data generators for structured and unstructured text
A reactive notebook for Python
An open source multi-tool for exploring and publishing data
Positron, a next-generation data science IDE
Build, run, and manage data pipelines for integrating data
Repository for the Astropy core package
AutoGluon: AutoML for Image, Text, and Tabular Data
airda(Air Data Agent
Data science on data without acquiring a copy
Always know what to expect from your data
The open standard for data logging