Showing 212 open source projects for "python data analysis"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Best-of Python

    Best-of Python

    A ranked list of awesome Python open-source libraries

    ...If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome! Ranked list of awesome python libraries for web development. Correctly generate plurals, ordinals, indefinite articles; convert numbers. Libraries for loading, collecting, and extracting data from a variety of data sources and formats. Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Complete-Python-Bootcamp

    Complete-Python-Bootcamp

    The complete Python bootcamp guide

    ...Later sections dive into advanced Python concepts, including memory management, multithreading, and multiprocessing, giving learners exposure to performance and concurrency topics that many beginner courses skip. The bootcamp also includes dedicated modules for data analysis with Python, working with databases, and logging, helping learners connect Python skills to data science and backend engineering tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Complete-Python-3-Bootcamp

    Complete-Python-3-Bootcamp

    Course Files for Complete Python 3 Bootcamp Course on Udemy

    The Complete-Python-3-Bootcamp repository is an educational resource created by Pierian Data as part of their popular Python for Data Science and Machine Learning Bootcamp course. It contains a comprehensive collection of Jupyter Notebooks designed to teach Python programming from the ground up. The repository covers a wide range of Python topics, including data types, control flow, functions, object-oriented programming, error handling, modules, and advanced concepts like decorators and generators. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Book2_Beauty-of-Data-Visualization

    Book2_Beauty-of-Data-Visualization

    Machine Learning, Criticism and Correction

    Book2_Beauty-of-Data-Visualization is an open educational project that teaches the principles and techniques of effective data visualization using Python and modern plotting libraries. The repository focuses on both the technical and aesthetic aspects of visual analytics, helping learners understand how to communicate data clearly and persuasively. It includes practical examples that demonstrate how different chart types reveal patterns, trends, and distributions in real datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    The Data Engineering Handbook

    The Data Engineering Handbook

    Links to everything you'd ever want to learn about data engineering

    The Data Engineering Handbook is a comprehensive, community-curated repository that aggregates essential learning resources for anyone interested in becoming a professional data engineer. Rather than being a code project itself, it’s a learning handbook that links to books, articles, tutorials, community groups, boot camps, and real-world project examples that collectively form a roadmap to mastering data engineering skills. It includes beginner and intermediate boot camps, interview guides,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Hello Python

    Hello Python

    Comprehensive tutorial repository aimed at teaching the Python program

    Hello-Python is a comprehensive tutorial repository aimed at teaching the Python programming language from scratch for beginners. It includes over 100 classes and about 44 hours of video instruction, combined with code samples, projects, and a chat community for support. The material covers the fundamentals—variables, data types, loops, functions—as well as intermediate topics like date handling, list comprehensions, file IO, regular expressions, modules, and packages. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Perfect Roadmap To Learn Data Science

    Perfect Roadmap To Learn Data Science

    Basic To Intermediate Python data science guide

    Perfect Roadmap To Learn Data Science In 2025 is an extended, updated learning pathway curated for the modern data-science landscape — blending classical data-analysis, statistics, machine learning, deep learning, computer vision, NLP, as well as current deployment and MLOps practices to prepare learners for data-science careers in 2025. The roadmap is organized to guide learners systematically: starting with Python fundamentals and math/statistics, then progressing through classical machine-learning, deep-learning, data preprocessing, feature engineering, and onto domain-specific applications like computer vision or NLP, ending with deployment, real-world project construction, and best practices for production readiness. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Book1_Python-For-Beginners

    Book1_Python-For-Beginners

    The Iris Book: Addition, Subtraction, Multiplication, and Division

    ...The material is structured to support self-paced learning, making it suitable for students, career switchers, and hobbyists. Because the book is part of a larger data science pathway, it also prepares readers for later work in visualization and machine learning. Overall, it serves as an accessible on-ramp into Python within a broader analytical learning journey.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    The Grand Complete Data Science Guide

    The Grand Complete Data Science Guide

    Data Science Guide With Videos And Materials

    ...The repository bundles tutorials, lecture notes, project outlines, course materials, and references across topics like Python, statistics, ML algorithms, deep learning, NLP, data preprocessing, model evaluation, and real-world problem solving. Its broad scope makes it particularly suitable for beginners or self-taught programmers who want an end-to-end learning track — from fundamentals all the way to building and deploying ML or AI systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    Python Zero to Hero for DevOps Engineers

    Python Zero to Hero for DevOps Engineers

    Learn Python from DevOps Engineer point of you

    Python Zero to Hero for DevOps Engineers is a structured “Python Zero to Hero for DevOps Engineers” course laid out as a day-by-day learning path. The repository is organized into Day-01 through Day-19 folders plus a small sample app, which makes it very easy to follow in sequence like a bootcamp. The curriculum starts with Python installation, environment setup, and writing your first script, then quickly moves into data types, strings, regular expressions, variables, and functions. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of speech and morphological features, to give a syntactic structure dependency parse, and to recognize named entities. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Think Python 2

    Think Python 2

    LaTeX source and supporting code for Think Python, 2nd edition

    ThinkPython2 is the repository for the second edition of Allen Downey’s Think Python textbook, which teaches programming fundamentals in Python to beginners. The code includes all of the example programs, exercises, and supplementary files referenced in the book, allowing learners to run the examples, experiment, and extend them. The repository contains clean, well-commented Python scripts that are easy to follow and map directly to chapters of the text, covering topics like variables, control flow, functions, recursion, data structures (lists, dictionaries), classes and objects, file I/O, and algorithmic thinking. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Python Core 50 Courses

    Python Core 50 Courses

    Structured learning path that organizes Python fundamentals

    Python-Core-50-Courses is a structured learning path that organizes Python fundamentals into 50 digestible lessons designed for steady, incremental progress. The curriculum starts with the basics—syntax, variables, data types, and control flow—then advances to functions, modules, object-oriented programming, and common standard-library utilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Book5_Essentials-Probability-Statistics

    Book5_Essentials-Probability-Statistics

    The book 5 of statistics in simplicity

    ...Like the other books in the series, it blends mathematical explanation with Python-based experimentation. Overall, the project provides a practical statistical foundation for students advancing into AI and data science.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PythonPark

    PythonPark

    Python open source project "The Road to Self-Study Programming"

    PythonPark is a large, curated “learning playground” for Python — essentially a comprehensive self-study meta-repository aimed at helping learners progress in Python programming, data science, machine learning, web scraping, and software engineering practices. It aggregates tutorials, learning guides, project examples, and resources across topics: from Python basics and data structures to machine learning, web scraping, and even interview preparation and “programmer life” guidance. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    LeetCode Book

    LeetCode Book

    Comprehensive study guide for coding interviews

    LeetCode-Book is a comprehensive study guide for coding interviews that consolidates algorithm patterns, data-structure templates, and worked LeetCode solutions. It organizes problems by topic—arrays, linked lists, stacks/queues, trees/graphs, dynamic programming, greedy, backtracking, and math—so you can study systematically. Explanations are concise but intentional, highlighting why a pattern fits, how to reason about boundary cases, and the time/space trade-offs. Many entries include...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17
    PocketFlow Tutorial Codebase Knowledge
    PocketFlow Tutorial Codebase Knowledge is a project that demonstrates how to build an AI agent capable of analyzing arbitrary codebases and generating beginner-friendly tutorials that explain how they work, turning complex source code into clear educational content. The repository builds on a lightweight 100-line LLM framework and uses natural language models to inspect repository structures, identify core abstractions, map dependencies, and articulate the reasoning behind code design and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    AutoResearchClaw

    AutoResearchClaw

    Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞

    AutoResearchClaw is an open-source framework designed to automatically generate full academic research papers from a single idea or topic. Built in Python, it orchestrates a multi-stage research pipeline that gathers literature, formulates hypotheses, runs experiments, analyzes results, and writes the final paper. The system retrieves real academic references from sources such as arXiv and Semantic Scholar to ensure credible citations. It can automatically generate code for experiments, run...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 19
    i-Educar

    i-Educar

    Launching the most free educational software in Brazil

    Accessible from anywhere and with single student registration available for the entire education network. Time-saving for everyone. Get current quantitative, financial and statistical data on all processes, at the time and place you want. Evaluation system and reports adapted to the different realities of the country, with numerical, conceptual or descriptive evaluation notes. Management of allocations, removals, substitutions, absences and delays, offering an integrated view of all professionals. Time frame management for analysis of demands and availability of professionals in the education network in each school period. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 20
    Interviews

    Interviews

    Comprehensive computer science and interview preparation guide

    Interviews is an open source repository designed as a comprehensive guide to preparing for technical interviews, with a strong focus on computer science fundamentals and algorithmic problem solving. Created by Kevin Naughton Jr., the project compiles detailed notes, explanations, and code implementations that cover core areas tested in software engineering interviews. The repository emphasizes topics such as data structures, algorithms, system design, operating systems, databases, and...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 21
    Recommenders

    Recommenders

    Best practices on recommendation systems

    The Recommenders repository provides examples and best practices for building recommendation systems, provided as Jupyter notebooks. The module reco_utils contains functions to simplify common tasks used when developing and evaluating recommender systems. Several utilities are provided in reco_utils to support common tasks such as loading datasets in the format expected by different algorithms, evaluating model outputs, and splitting training/test data. Implementations of several...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Archivematica

    Archivematica

    Free and open-source digital preservation system

    Archivematica is a web- and standards-based, open-source application which allows your institution to preserve long-term access to trustworthy, authentic, and reliable digital content. Our target users are archivists, librarians, and anyone working to preserve digital objects. You are free to copy, modify, and distribute Archivematica with attribution under the terms of the AGPLv3 license. Archivematica is an open-source application based on recognized standards that makes it possible to...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    Scientific Visualization

    Scientific Visualization

    An open access book on scientific visualization using python

    The Scientific Visualization book is a freely available open-access textbook that introduces how to produce effective scientific visualizations using Python, focusing especially on leveraging the popular plotting library Matplotlib (and related tools). It goes beyond simple plotting tutorials and emphasizes design principles: how to choose colors, layout subplots, annotate graphs, and present data in a way that is both accurate and visually compelling. As such, it serves as a guide for researchers, data scientists, and academic authors who need to create publication-quality figures or explanatory graphics, rather than quick exploratory plots. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    nanoGPT

    nanoGPT

    The simplest, fastest repository for training/finetuning models

    NanoGPT is a minimalistic yet powerful reimplementation of GPT-style transformers created by Andrej Karpathy for educational and research use. It distills the GPT architecture into a few hundred lines of Python code, making it far easier to understand than large, production-scale implementations. The repo is organized with a training pipeline (dataset preprocessing, model definition, optimizer, training loop) and inference script so you can train a small GPT on text datasets like Shakespeare...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Interpretable machine learning

    Interpretable machine learning

    Book about interpretable machine learning

    This book is about interpretable machine learning. Machine learning is being built into many products and processes of our daily lives, yet decisions made by machines don't automatically come with an explanation. An explanation increases the trust in the decision and in the machine learning model. As the programmer of an algorithm you want to know whether you can trust the learned model. Did it learn generalizable features? Or are there some odd artifacts in the training data which the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB