Showing 171 open source projects for "python data analysis"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    AI Researcher

    AI Researcher

    An autonomous AI researcher

    AI Researcher is an experimental open-source project that demonstrates how multiple AI agents can collaborate to conduct complex research tasks from start to finish with minimal human intervention. It orchestrates agents that can generate research questions, perform literature reviews, execute experiments, analyze results, and synthesize findings into structured outputs like reports or code. Each agent operates with clear roles — such as researcher, analyst, and summarizer — and they...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Kubernetes Handbook

    Kubernetes Handbook

    Cloud native application architecture practice handbook

    Cloud native is a behavioral method and design concept. In its essence, all behaviors or methods that can improve resource utilization and application delivery efficiency on the cloud are cloud-native. The history of cloud computing is a history of cloud native. Kubernetes opened the prelude to cloud native 1.0. The emergence of service mesh Istio led to microservices in the post-Kubernetes era. The rise of serverless has enabled cloud native to advance from the infrastructure layer to the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Megatron-LM

    Megatron-LM

    Ongoing research training transformer models at scale

    Megatron-LM is a GPU-optimized deep learning framework from NVIDIA designed to train extremely large transformer-based language models efficiently at scale. The repository provides both a reference training implementation and Megatron Core, a composable library of high-performance building blocks for custom large-model pipelines. It supports advanced parallelism strategies including tensor, pipeline, data, expert, and context parallelism, enabling training across massive multi-GPU and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Hello SQL

    Hello SQL

    Spanish-language course repository that teaches fundamentals of SQL

    hello-sql is a beginner-friendly, Spanish-language course repository that teaches the fundamentals of SQL and relational databases through practical examples. It focuses mainly on MySQL for lessons due to its ubiquity in education and professional environments, while also introducing PostgreSQL to broaden learners’ exposure to modern database tooling. The materials emphasize real-world query writing, schema design basics, and the mental model behind SELECT, JOIN, GROUP BY, and subqueries....
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    JSONLab

    JSONLab

    JSONLab: compact, portable, robust JSON/binary-JSON encoder

    JSONLab is a free and open-source JSON/UBJSON/MessagePack encoder and decoder written in the native MATLAB language. It can be used to convert a MATLAB data structure (array, struct, cell, struct array, cell array, and objects) into JSON/UBJSON/MessagePack formatted strings and files, or to parse a JSON/UBJSON/MessagePack file into MATLAB data structure. JSONLab supports nearly all versions of MATLAB and GNU Octave (a free MATLAB clone). The development of JSONLab is currently funded by the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    CS-Books

    CS-Books

    Collection of computer science textbooks, learning materials

    CS-Books is a massive curated collection of computer science textbooks, learning materials, and resource links that covers a wide range of topics from programming languages like C/C++ and Python to core subjects such as data structures, algorithms, operating systems, databases, networks, and design patterns. The repository aggregates over a thousand classic reference books and educational resources into a single index, making it a valuable starting point for self-learners, students preparing for technical interviews, and professionals deepening their knowledge across different CS domains. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    xrayutilities

    xrayutilities

    a package with useful scripts for X-ray diffraction physicists

    xrayutilities is a python package used to analyze x-ray diffraction data. It can support with performing diffraction experiments and used for common steps in the data analysis. It can read experimental data from several data formats (spec, edf, xrdml, ...); convert them to reciprocal space for arbitrary goniometer geometries and different detector systems (point, linear as well as area detectors); for further processing the data can be gridded (transformed to a regular grid). ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Digital Forensics Guide

    Digital Forensics Guide

    Learn all about Digital Forensics and Computer Forensics

    The Digital Forensics Guide repository is a comprehensive, structured reference for investigators, analysts, students, and cybersecurity professionals interested in digital forensics principles, tools, methodologies, and workflows. It organizes foundational topics such as evidence acquisition, disk and memory analysis, file system structures, network forensics, artifact extraction, timeline generation, and reporting into digestible modules that help build core competency. Alongside conceptual explanations, the guide includes practical examples with widely used tools (like Autopsy, Volatility, Sleuth Kit, and network analysis suites), illustrating how investigations proceed from initial data capture to final analysis.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9

    MatCont

    Numerical Bifurcation Analysis Toolbox in Matlab

    ...There is a manual with detailed descriptions of the data.
    Leader badge
    Downloads: 224 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    MathGL

    MathGL

    A library for scientific data visualization

    A free cross-platform library of fast C++ routines for the plotting of up to 3-ranged data. It can export to bitmap and vector EPS/SVG files. There are window interfaces (GLUT/FLTK/Qt) and console tools. MathGL can be used from C/Fortran/Python/Octav/Lua
    Leader badge
    Downloads: 41 This Week
    Last Update:
    See Project
  • 12
    Quite Universal Circuit Simulator

    Quite Universal Circuit Simulator

    A circuit simulator with graphical user interface (GUI)

    Qucs is a circuit simulator with graphical user interface. The software aims to support all kinds of circuit simulation types, e.g. DC, AC, S-parameter, Transient, Noise and Harmonic Balance analysis. Pure digital simulations are also supported.
    Leader badge
    Downloads: 1,130 This Week
    Last Update:
    See Project
  • 13
    OSGeoLive

    OSGeoLive

    Bootable linux distro for Geospatial Applications

    ...It is composed entirely of free software, allowing it to be freely distributed, duplicated and passed around. It provides pre-configured applications for a range of geospatial use cases, including storage, publishing, viewing, analysis and manipulation of data. It also contains sample datasets and documentation.
    Leader badge
    Downloads: 186 This Week
    Last Update:
    See Project
  • 14
    Object-oriented crystallographic library and program, for the analysis of Crystal structures from scattering experiments: optimized scattering computation, Graphical interface for structures and data, global optimization algorithms.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    Fuzzy machine learning framework

    Fuzzy machine learning framework

    A library and a GUI front-end for fuzzy machine learning

    Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    JQM Java Quine McCluskey

    JQM Java Quine McCluskey

    JQM - Java Quine McCluskey for minimization of Boolean functions.

    Java Quine McCluskey (JQM) implements the Quine-McCluskey algorithm with Petrick’s Method for minimizing Boolean functions. Designed for both education and industrial application, it handles up to 16 variables and functions. Uniquely, JQM bridges the gap between theory and practice: it visualizes the solution process with generated Karnaugh Maps for students, while supporting PLC engineers by exporting results to Structured Text (ST) and Ladder Diagram (LD). The software includes a GUI for...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    CiteSpace

    CiteSpace

    A widely used tool for visual exploration of scientific literature.

    Visit the new site: https://citespace.podia.com CiteSpace generates interactive visualizations of structural and temporal patterns and trends of a scientific field. It facilitates a systematic review of a knowledge domain through an in-depth visual analytic process. It can process citation data from popular sources such as the Web of Science, Scopus, Dimensions, and the Lens. CiteSpace also supports basic visual analytic functions for datasets without citation-related information, for...
    Leader badge
    Downloads: 2,097 This Week
    Last Update:
    See Project
  • 18
    Python Data Science Handbook

    Python Data Science Handbook

    Python Data Science Handbook: full text in Jupyter Notebooks

    The Python Data Science Handbook is a comprehensive collection of Jupyter notebooks written by Jake VanderPlas covering fundamental Python libraries for data science, including IPython, NumPy, Pandas, Matplotlib, Scikit-Learn and more. The project is designed for data scientists, researchers, and anyone transitioning into Python-based data work; it assumes you already know basic Python and focuses more on how to use the ecosystem effectively. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 19
    Elementary Algorithms

    Elementary Algorithms

    Book of elementary algorithms and data structures

    This book introduces elementary algorithms and data structure. It includes side-by-side comparison of purely functional realization and their imperative counterpart. From 2020/12, I started re-writing this book. The PDF can be downloaded for preview (EN, 中文). The 1st edition in Chinese (中文) was published in 2017. I recently switched my focus to the Mathematics of programming, the new book is also available in (github). To build the book in PDF format from the sources, you need the following...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    ThinkJulia.jl

    ThinkJulia.jl

    Port of the book Think Python to the Julia programming language

    ThinkJulia.jl is an open source educational project that adapts Think Python by Allen B. Downey into the Julia programming language, with contributions by Ben Lauwens. It provides a comprehensive introduction to programming and computational thinking using Julia’s modern, high-performance features. The book is structured to gradually teach core concepts such as variables, control flow, functions, recursion, object-oriented programming, and data structures, while offering hands-on exercises to reinforce each topic. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    DIG

    DIG

    A library for graph deep learning research

    The key difference with current graph deep learning libraries, such as PyTorch Geometric (PyG) and Deep Graph Library (DGL), is that, while PyG and DGL support basic graph deep learning operations, DIG provides a unified testbed for higher level, research-oriented graph deep learning tasks, such as graph generation, self-supervised learning, explainability, 3D graphs, and graph out-of-distribution. If you are working or plan to work on research in graph deep learning, DIG enables you to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    data-science-on-gcp

    data-science-on-gcp

    Source code accompanying book: Data Science on the GCP

    The data-science-on-gcp repository is a comprehensive collection of code examples and end-to-end workflows that accompany the book Data Science on the Google Cloud Platform, designed to teach developers how to build scalable data science and machine learning systems using Google Cloud services. It provides structured, chapter-aligned implementations that guide users through the full lifecycle of a data science project, including data ingestion, storage, processing, analysis, model training, and deployment. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Rocketry Research Assistant

    Rocketry Research Assistant

    A LibreOffice Base database for tracking rocketry research projects.

    Research and engineering projects collect a lot of data. Data is collected during the development of the launch vehicle and payload. Data is collected during testing of the system, and data is collected during the actual flights. Working with data doesn’t stop there. Once the data has been collected it must be reviewed to make sure that the project is performing as expected. Data analysis is a critical part of any research or engineering project. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The Art of Programming

    The Art of Programming

    A collection of practical tips can be found at the bottom of this page

    The Art of Programming (Second Edition) is a curated collection of programming problems and solutions originally derived from the Microsoft 100 Interview Questions blog series, later refined into a long-running tutorial and ultimately a published book. Created by July, the series began in 2010 and has since evolved into an in-depth exploration of algorithmic thinking, data structures, and coding interview preparation. The repository brings together 42 classic programming problems from the...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 25
    Data Analysis for the Life Sciences

    Data Analysis for the Life Sciences

    Rmd source files for the HarvardX series PH525x

    This repository holds the R Markdown (.Rmd) source files for the PH525x / HarvardX course series (Data Analysis for the Life Sciences / Genomics) managed by GenomicsClass. It functions as the canonical source for course lab exercises, lecture modules, and reading materials in reproducible format. Students and learners use these R Markdown files to follow along, knit notebooks, run code samples, and complete the lab-based assignments.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB