Showing 998 open source projects for "python data analysis"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Google IPs

    Google IPs

    Public IP address ranges associated with Google infrastructure

    Google-IPs aggregates public IP address ranges that are associated with Google’s infrastructure, collecting them in machine-readable formats useful for routing and firewall rules. The list is helpful for administrators who need to whitelist Google endpoints, analyze traffic, or tune proxies and CDN configurations. By centralizing ranges that are otherwise spread across announcements and registries, it saves time and reduces misconfiguration risk. The repository typically includes CIDR blocks...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Infoooze

    Infoooze

    A OSINT tool which helps you to quickly find information effectively.

    infoooze is an open-source information-gathering tool for cybersecurity and OSINT (Open-Source Intelligence) purposes. It automates the process of collecting public data about domains, emails, IP addresses, and more.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 3
    Grow.dev

    Grow.dev

    A declarative website generator designed for high-quality websites

    Grow.dev is a static site generator optimized for building highly interactive, localized microsites. Grow.dev focuses on providing optimal workflows and developer ergonomics for creating projects that are highly maintainable in the long term. Grow.dev encourages a strong but simple separation of content and presentation and makes maintaining content in different locales and environments a snap.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    Julia.jl

    Julia.jl

    Curated decibans of Julia programming language

    Julia.jl is a curated collection of knowledge resources for the Julia programming language, designed to support high-performance numerical analysis and computational science. The repository aggregates diverse content across domains such as mathematics, physics, data science, optimization, machine learning, and supercomputing. It functions as a structured index, helping developers, researchers, and learners easily find materials to deepen their understanding of Julia’s ecosystem. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    LeetCode Python

    LeetCode Python

    LeetCode Solutions: A Record of My Problem Solving Journey

    This repository is a comprehensive personal journal of LeetCode problem-solving journey. It includes detailed solutions with code, algorithm insights, data structure summaries, Anki flashcards, daily challenge logs, and future planning sections.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Function Point Analysis

    Function Point Analysis

    Progressive Function Point Analysis Workbook in Excel

    Progressive Function Point Analysis Book Released: Print Copy: http://www.amazon.com/Progressive-Function-Point-Analysis-Estimation/dp/1502354160 EBook: http://www.amazon.com/dp/B00NH0MMAG Please support this project and its future releases ! Managers provide the architect/teams with project requirements. The architect/team returns detailed task list with estimated man hours per task making life easier for Project Managers with less guesstimates and proper business/technical process...
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    CAT

    CAT

    CAT is the basic component of the server project

    ...In addition to metrics, it enables tracing—propagating context across RPC boundaries so problems like latency spikes or failed calls can be traced end-to-end. Alert rules and anomaly detection can be defined to notify teams proactively. The system supports multiple data backends and ingestion pipelines to collect data from JVM, C/C++, Python, and other ecosystems. With the collected data, Cat supports analysis of hotspots, trending anomalies, and capacity planning to drive continuous reliability improvements.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Whisper Library

    Whisper Library

    Whisper is a file-based time-series database format for Graphite

    Whisper is one of three components within the Graphite project. Whisper is a fixed-size database, similar in design and purpose to RRD (round-robin-database). It provides fast, reliable storage of numeric data over time. Whisper allows for higher resolution (seconds per point) of recent data to degrade into lower resolutions for long-term retention of historical data. Copies data from src in dst, if missing. Unlike whisper-merge, don't overwrite data that's already present in the target...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    CoolplaySpark

    CoolplaySpark

    Spark Cool Play: Spark source code analysis, Spark class library, etc.

    CoolplaySpark is a learning and practice repository designed to help users understand and work with Apache Spark. It serves as a companion resource for the book 深入理解Spark核心思想与源码分析 (In-Depth Understanding of Spark’s Core Concepts and Source Code Analysis). The project contains annotated examples, explanations, and exercises that guide learners through Spark’s architecture, execution model, and source code internals. It is particularly valuable for developers who want to strengthen their understanding of Spark by not only using it as a data processing engine but also exploring how its internals function. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    comby

    comby

    A code rewrite tool for structural search and replace that supports

    Comby is a tool for searching and changing code structure. Use lightweight templates to easily search and change code, HTML, or JSON. Comby is designed to work on any language or data format. Perform richer search and replace because Comby understands the syntax of code blocks, strings, and comments for your language. Comby is ideal for touching up pieces of code. Use it to translate code like this Python 2 to 3 fixer on the right to replace deprecated methods. Easily write one-off refactors or a collection of quick fixes customized to your project. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Blankly

    Blankly

    Easily build, backtest and deploy your algo in just a few lines

    ​Blankly is a live trading engine, backtest runner and development framework wrapped into one powerful open-source package. Models can be instantly backtested, paper traded, sandbox tested and run live by simply changing a single line. We built blankly for every type of quant including training & running ML models in the same environment, cross-exchange/cross-symbol arbitrage, and even long/short positions on stocks (all with built-in WebSockets). Blankly is the first framework to enable...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    m2cgen

    m2cgen

    Transform ML models into a native code

    m2cgen (Model 2 Code Generator) - is a lightweight library that provides an easy way to transpile trained statistical models into a native code (Python, C, Java, Go, JavaScript, Visual Basic, C#, PowerShell, R, PHP, Dart, Haskell, Ruby, F#, Rust, Elixir). Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies. Some models force input data to be particular type during prediction phase in their native Python libraries. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Google Cloud Vision API examples

    Google Cloud Vision API examples

    Sample code for Google Cloud Vision

    The cloud-vision repository is a sample code collection for the Google Cloud Vision API that shows developers how to implement image analysis tasks across a wide range of languages and platforms. It contains examples organized by language and environment, including Go, Java, Node.js, PHP, Python, Ruby, .NET, Android, iOS, and even a Chrome extension, which makes it especially valuable as a cross-platform learning resource. The repository demonstrates concrete image understanding use cases, such as landmark detection and mobile photo analysis with label and face detection, so developers can see how Vision API outputs are consumed in real interfaces and workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    waybackurls

    waybackurls

    Fetch all the URLs that the Wayback Machine knows about for a domain

    ...The tool is particularly valuable in security research, bug bounty hunting, and penetration testing, as it uncovers endpoints that may no longer be publicly linked but still exist on servers. By leveraging archived data, waybackurls helps identify hidden attack surfaces, legacy APIs, and forgotten resources that could be vulnerable. Its design is intentionally simple and efficient, focusing on delivering large volumes of URLs quickly with minimal configuration. The output can be combined with other tools for further analysis, such as filtering parameters or probing endpoints.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    Binarytree

    Binarytree

    Python library for studying Binary Trees

    Binarytree is Python library that lets you generate, visualize, inspect and manipulate binary trees. Skip the tedious work of setting up test data, and dive straight into practicing algorithms. Heaps and BSTs (binary search trees) are also supported. Binarytree supports another representation which is more compact but without the indexing properties.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Triton

    Triton

    Triton is a dynamic binary analysis library

    ...A potential way to getting started is to see our Python examples already bootstrapped.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    NTU RGB-D

    NTU RGB-D

    Info and sample codes for "NTU RGB+D Action Recognition Dataset"

    The “NTU RGB+D” repository provides access to a large-scale dataset for human action recognition (and its extension, NTU RGB+D 120). The dataset includes multiple modalities (RGB video, depth sequences, infrared video, 3D skeletal joint data) captured with multiple Kinect v2 cameras simultaneously. The repository also contains MATLAB / Python demo scripts for loading, visualizing, and processing skeleton data, mapping between modalities, and handling dataset structure. Multi-modal action recognition dataset, RGB, depth, infrared, skeletal data. Split into background / evaluation sets for one-shot evaluation (in the extended dataset).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a direct hands-on reference for students following the 2022 recorded lecture series. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    AWS Step Functions Data Science SDK

    AWS Step Functions Data Science SDK

    For building machine learning (ML) workflows and pipelines on AWS

    The AWS Step Functions Data Science SDK is an open-source library that allows data scientists to easily create workflows that process and publish machine learning models using Amazon SageMaker and AWS Step Functions. You can create machine learning workflows in Python that orchestrate AWS infrastructure at scale, without having to provision and integrate the AWS services separately.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Manticore

    Manticore

    Symbolic execution tool

    Manticore helps us quickly take advantage of symbolic execution, taint analysis, and instrumentation to analyze binaries. Parts of Manticore underpinned our symbolic execution capabilities in the Cyber Grand Challenge. As an open-source tool, we hope that others can take advantage of these capabilities in their own projects. We prioritized simplicity and usability while building Manticore. We used minimal external dependencies and our API should look familiar to anyone with exploitation or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    MS-Helios

    MS-Helios: A Circos wrapper to visualize multi-omic datasets

    Advances in high-resolution mass spectrometry facilitate the identification of hundreds of metabolites, thousands of proteins and their post-translational modifications. This remarkable progress poses a challenge to data analysis and visualization, requiring methods to reduce dimensionality and represent the data in a compact way. To provide a more holistic view, we recently introduced circular proteome maps (CPMs). However, the CPM construction requires prior data transformation and extensive knowledge of the Perl-based tool, Circos. We present MS-Helios, an easy to use command line tool with multiple built-in data processing functions, allowing non-expert users to construct CPMs or in general terms circular plots with a non-genomic basis. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Model Search

    Model Search

    Framework that implements AutoML algorithms

    Model Search is an AutoML research system for discovering neural network architectures with minimal human intervention. Instead of hand-crafting models, you define a search space and objectives, then the system explores candidate architectures using controllers and population-based strategies. It supports multiple tasks (such as vision or text) by letting you express reusable building blocks—layers, cells, and topologies—that the search can recombine. Training, evaluation, and promotion of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Smart Contract Sanctuary

    Smart Contract Sanctuary

    A home for ethereum smart contracts

    ...Contains smart contract sources for various networks, grouped by the first two chars of the contract address. A scriptable semantic grep utility for solidity (crunch numbers, find specific contracts, extract data) Semgrep is a fast, open-source, static analysis tool for finding bugs and enforcing code standards at editor, commit, and CI time, and now supports Solidity! A powerful online code search service that can be used to search the sanctuary without cloning.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB