Efficiently diff rows across two different databases
Data processing for and with foundation models
Orange: Interactive data analysis
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Self-learning data agent that grounds its answers in layers of content
Links to everything you'd ever want to learn about data engineering
Project structure for doing and sharing data science work
An end-to-end Data Scientist
Minimal examples of data structures and algorithms in Python
Tool for generating high quality Synthetic datasets
Synthetic Data Generation for tabular, relational and time series data
An AI-powered data science team of agents
dude uncomplicated data extraction: A simple framework
Data Science Guide With Videos And Materials
Official DeiT repository
Yahoo! Finance market data downloader
Fast, flexible and powerful Python data analysis toolkit
Shredos Disk Eraser 64 bit for all Intel 64 bit processors
Blender addons to make the bridge between Blender and geographic data
Machine learning in Python
Data integration platform for ELT pipelines from APIs, databases
OCRmyPDF adds an OCR text layer to scanned PDF files
Training data (data labeling, annotation, workflow) for all data types
Conditional GAN for generating synthetic tabular data
Label Studio is a multi-type data labeling and annotation tool