Analyze computation-communication overlap in V3/R1
Data processing for and with foundation models
Git-based data version control for machine learning workflows
Data Science Roadmap from A to Z
Collection of useful data science topics along with articles
Curated list of data science interview questions and answers
Data science interview questions and answers
SDG is a specialized framework
Self-learning data agent that grounds its answers in layers of content
An end-to-end Data Scientist
A Collection of Cheatsheets, Books, Questions, and Portfolio
Synthetic Data Generation for tabular, relational and time series data
From Addition, Subtraction, Multiplication, and Division to ML
Data annotator for machine learning
A curated list of applied machine learning and data science notebooks
Deep Research framework, combining language models with tools
Project aimed at extracting, exporting, and analyzing chat records
Video-based AI memory library. Store millions of text chunks in MP4
Official DeiT repository
Your own personal AI assistant. Any OS. Any Platform.
Machine learning in Python
OCRmyPDF adds an OCR text layer to scanned PDF files
Self-contained, offline survival computer with tools, knowledge, & AI
Free and source-available fair-code licensed workflow automation tool
Text mining using tidy tools