Showing 85 open source projects for "data analysis"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    Perfect Roadmap To Learn Data Science

    Perfect Roadmap To Learn Data Science

    Basic To Intermediate Python data science guide

    Perfect Roadmap To Learn Data Science In 2025 is an extended, updated learning pathway curated for the modern data-science landscape — blending classical data-analysis, statistics, machine learning, deep learning, computer vision, NLP, as well as current deployment and MLOps practices to prepare learners for data-science careers in 2025. The roadmap is organized to guide learners systematically: starting with Python fundamentals and math/statistics, then progressing through classical machine-learning, deep-learning, data preprocessing, feature engineering, and onto domain-specific applications like computer vision or NLP, ending with deployment, real-world project construction, and best practices for production readiness. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Complete-Python-3-Bootcamp

    Complete-Python-3-Bootcamp

    Course Files for Complete Python 3 Bootcamp Course on Udemy

    ...In addition, it includes applied exercises in areas such as web scraping, working with APIs, and using Python libraries like NumPy, pandas, Matplotlib, and Seaborn for data analysis and visualization. Learners can progress from beginner-friendly basics to more advanced programming skills while reinforcing their knowledge with practice problems and projects. Because it mirrors the course content, this repository is widely used by students taking the Udemy course.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    Complete-Python-Bootcamp

    Complete-Python-Bootcamp

    The complete Python bootcamp guide

    ...The bootcamp also includes dedicated modules for data analysis with Python, working with databases, and logging, helping learners connect Python skills to data science and backend engineering tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    i-Educar

    i-Educar

    Launching the most free educational software in Brazil

    Accessible from anywhere and with single student registration available for the entire education network. Time-saving for everyone. Get current quantitative, financial and statistical data on all processes, at the time and place you want. Evaluation system and reports adapted to the different realities of the country, with numerical, conceptual or descriptive evaluation notes. Management of allocations, removals, substitutions, absences and delays, offering an integrated view of all professionals. Time frame management for analysis of demands and availability of professionals in the education network in each school period. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    Interviews

    Interviews

    Comprehensive computer science and interview preparation guide

    Interviews is an open source repository designed as a comprehensive guide to preparing for technical interviews, with a strong focus on computer science fundamentals and algorithmic problem solving. Created by Kevin Naughton Jr., the project compiles detailed notes, explanations, and code implementations that cover core areas tested in software engineering interviews. The repository emphasizes topics such as data structures, algorithms, system design, operating systems, databases, and...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of speech and morphological features, to give a syntactic structure dependency parse, and to recognize named entities. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    LeetCode Book

    LeetCode Book

    Comprehensive study guide for coding interviews

    LeetCode-Book is a comprehensive study guide for coding interviews that consolidates algorithm patterns, data-structure templates, and worked LeetCode solutions. It organizes problems by topic—arrays, linked lists, stacks/queues, trees/graphs, dynamic programming, greedy, backtracking, and math—so you can study systematically. Explanations are concise but intentional, highlighting why a pattern fits, how to reason about boundary cases, and the time/space trade-offs. Many entries include...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Book5_Essentials-Probability-Statistics

    Book5_Essentials-Probability-Statistics

    The book 5 of statistics in simplicity

    Book5_Essentials-of-Probability-and-Statistics is a Visualize-ML educational volume that introduces the statistical and probabilistic concepts underpinning modern data analysis and machine learning. The repository explains topics such as distributions, sampling, inference, and uncertainty using visual demonstrations and intuitive narratives. Its teaching philosophy prioritizes conceptual clarity over heavy formalism, making statistical thinking more approachable for beginners. The material connects probability theory directly to real analytical workflows, helping learners understand how statistics supports predictive modeling. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Kubernetes Handbook

    Kubernetes Handbook

    Cloud native application architecture practice handbook

    Cloud native is a behavioral method and design concept. In its essence, all behaviors or methods that can improve resource utilization and application delivery efficiency on the cloud are cloud-native. The history of cloud computing is a history of cloud native. Kubernetes opened the prelude to cloud native 1.0. The emergence of service mesh Istio led to microservices in the post-Kubernetes era. The rise of serverless has enabled cloud native to advance from the infrastructure layer to the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10

    MatCont

    Numerical Bifurcation Analysis Toolbox in Matlab

    ...There is a manual with detailed descriptions of the data.
    Leader badge
    Downloads: 137 This Week
    Last Update:
    See Project
  • 11
    xrayutilities

    xrayutilities

    a package with useful scripts for X-ray diffraction physicists

    xrayutilities is a python package used to analyze x-ray diffraction data. It can support with performing diffraction experiments and used for common steps in the data analysis. It can read experimental data from several data formats (spec, edf, xrdml, ...); convert them to reciprocal space for arbitrary goniometer geometries and different detector systems (point, linear as well as area detectors); for further processing the data can be gridded (transformed to a regular grid). ...
    Downloads: 81 This Week
    Last Update:
    See Project
  • 12
    Quite Universal Circuit Simulator

    Quite Universal Circuit Simulator

    A circuit simulator with graphical user interface (GUI)

    Qucs is a circuit simulator with graphical user interface. The software aims to support all kinds of circuit simulation types, e.g. DC, AC, S-parameter, Transient, Noise and Harmonic Balance analysis. Pure digital simulations are also supported.
    Leader badge
    Downloads: 615 This Week
    Last Update:
    See Project
  • 13
    OSGeoLive

    OSGeoLive

    Bootable linux distro for Geospatial Applications

    ...It is composed entirely of free software, allowing it to be freely distributed, duplicated and passed around. It provides pre-configured applications for a range of geospatial use cases, including storage, publishing, viewing, analysis and manipulation of data. It also contains sample datasets and documentation.
    Leader badge
    Downloads: 90 This Week
    Last Update:
    See Project
  • 14
    Python Zero to Hero for DevOps Engineers

    Python Zero to Hero for DevOps Engineers

    Learn Python from DevOps Engineer point of you

    ...The repository is organized into Day-01 through Day-19 folders plus a small sample app, which makes it very easy to follow in sequence like a bootcamp. The curriculum starts with Python installation, environment setup, and writing your first script, then quickly moves into data types, strings, regular expressions, variables, and functions. It places a strong emphasis on DevOps-specific use cases: environment variables, command-line arguments, configuration handling, and automating log analysis or user management tasks are all explicitly woven into the exercises. As you progress, you encounter increasingly rich Python features such as lists (with list comprehensions), dictionaries, sets, operators, and control flow, always tied back to practical automation or infrastructure examples.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Object-oriented crystallographic library and program, for the analysis of Crystal structures from scattering experiments: optimized scattering computation, Graphical interface for structures and data, global optimization algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    JQM Java Quine McCluskey

    JQM Java Quine McCluskey

    JQM - Java Quine McCluskey for minimization of Boolean functions.

    Java Quine McCluskey (JQM) implements the Quine-McCluskey algorithm with Petrick’s Method for minimizing Boolean functions. Designed for both education and industrial application, it handles up to 16 variables and functions. Uniquely, JQM bridges the gap between theory and practice: it visualizes the solution process with generated Karnaugh Maps for students, while supporting PLC engineers by exporting results to Structured Text (ST) and Ladder Diagram (LD). The software includes a GUI for...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Digital Forensics Guide

    Digital Forensics Guide

    Learn all about Digital Forensics and Computer Forensics

    The Digital Forensics Guide repository is a comprehensive, structured reference for investigators, analysts, students, and cybersecurity professionals interested in digital forensics principles, tools, methodologies, and workflows. It organizes foundational topics such as evidence acquisition, disk and memory analysis, file system structures, network forensics, artifact extraction, timeline generation, and reporting into digestible modules that help build core competency. Alongside conceptual explanations, the guide includes practical examples with widely used tools (like Autopsy, Volatility, Sleuth Kit, and network analysis suites), illustrating how investigations proceed from initial data capture to final analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CiteSpace

    CiteSpace

    A widely used tool for visual exploration of scientific literature.

    Visit the new site: https://citespace.podia.com CiteSpace generates interactive visualizations of structural and temporal patterns and trends of a scientific field. It facilitates a systematic review of a knowledge domain through an in-depth visual analytic process. It can process citation data from popular sources such as the Web of Science, Scopus, Dimensions, and the Lens. CiteSpace also supports basic visual analytic functions for datasets without citation-related information, for...
    Leader badge
    Downloads: 1,549 This Week
    Last Update:
    See Project
  • 20
    Python Data Science Handbook

    Python Data Science Handbook

    Python Data Science Handbook: full text in Jupyter Notebooks

    ...Each chapter is a standalone Jupyter notebook, with runnable code, explanatory prose, visuals, and examples showing how to handle data-wrangling, exploratory data analysis, machine learning workflows, and visualization. The repository is freely available and the code is released under the MIT license; the textual content is released under a Creative Commons license. Users can also launch the notebooks in Google Colab or Binder directly, making it extremely accessible.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Rocketry Research Assistant

    Rocketry Research Assistant

    A LibreOffice Base database for tracking rocketry research projects.

    Research and engineering projects collect a lot of data. Data is collected during the development of the launch vehicle and payload. Data is collected during testing of the system, and data is collected during the actual flights. Working with data doesn’t stop there. Once the data has been collected it must be reviewed to make sure that the project is performing as expected. Data analysis is a critical part of any research or engineering project. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Java Noodle Manual

    Java Noodle Manual

    The Java Face Sutra Manual consists of 5 chapters, 29 sections

    ...It uses interview questions as an entry point to explain deeper Java fundamentals instead of offering shallow memorization material. The content covers resume preparation, interview strategy, data structures, algorithms, concurrency, multithreading, JVM concepts, and source-code analysis. It includes practical examples so readers can test and verify the concepts while studying. The project is aimed at developers with some programming foundation, especially those with one to three years of experience preparing for stronger technical interviews. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    data-science-on-gcp

    data-science-on-gcp

    Source code accompanying book: Data Science on the GCP

    The data-science-on-gcp repository is a comprehensive collection of code examples and end-to-end workflows that accompany the book Data Science on the Google Cloud Platform, designed to teach developers how to build scalable data science and machine learning systems using Google Cloud services. It provides structured, chapter-aligned implementations that guide users through the full lifecycle of a data science project, including data ingestion, storage, processing, analysis, model training, and deployment. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Data Analysis for the Life Sciences

    Data Analysis for the Life Sciences

    Rmd source files for the HarvardX series PH525x

    This repository holds the R Markdown (.Rmd) source files for the PH525x / HarvardX course series (Data Analysis for the Life Sciences / Genomics) managed by GenomicsClass. It functions as the canonical source for course lab exercises, lecture modules, and reading materials in reproducible format. Students and learners use these R Markdown files to follow along, knit notebooks, run code samples, and complete the lab-based assignments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Statistics for Data Scientists

    Statistics for Data Scientists

    "Statistics for Data Scientists: 50 Essential Concepts"

    The “statistics-for-data-scientists” repository is a pedagogical resource designed to bridge rigorous statistics theory and practical data science workflows. The code and materials are intended to help data scientists and analysts grasp statistical principles (e.g. inference, regressions, hypothesis testing, probability, confidence intervals) in contexts relevant to real data analysis tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
Auth0 Logo