The “statistics-for-data-scientists” repository is a pedagogical resource designed to bridge rigorous statistics theory and practical data science workflows. The code and materials are intended to help data scientists and analysts grasp statistical principles (e.g. inference, regressions, hypothesis testing, probability, confidence intervals) in contexts relevant to real data analysis tasks. The repository includes Jupyter notebooks, R scripts, worked examples, and possibly problem sets that illustrate how statistical methods are applied to real datasets. It aims to demystify the bridge between textbook statistics and empirical modeling by walking through assumption checking, visualization, interpreting outputs, and pitfalls of misuse. ...