Showing 282 open source projects for "python data analysis"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Alpa

    Alpa

    Training and serving large-scale neural networks

    Alpa is a system for training and serving large-scale neural networks. Scaling neural networks to hundreds of billions of parameters has enabled dramatic breakthroughs such as GPT-3, but training and serving these large-scale neural networks require complicated distributed system techniques. Alpa aims to automate large-scale distributed training and serving with just a few lines of code.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    CSI-Math-Notation-PostfixInfix

    CSI-Math-Notation-PostfixInfix

    Perl Lib Math Notation

    ... - The library provide: - Convert INFIX expressions to POSTFIX; - Convert POSTFIX expressions to INFIX and; - Perform POSTFIX context validations. - Context validation can be implemented in item selection routines or data context validation, when it is possible to identify data to be selected or ignored in some data analysis process. * NOTE: - Before any implementation, we recommend details in WIKI (https://sourceforge.net/p/csi-math-notation-postfixinfix/wiki/) or the CPAN Perl Modules, see: https://metacpan.org/pod/Math::Notation::PostfixInfix * Support: The Support Service is FREE. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Data science blogs

    Data science blogs

    A curated list of data science blogs

    Data Science Blogs is a curated repository that aggregates a wide range of high-quality blogs and resources related to data science, machine learning, and analytics into a single organized collection. It serves as a discovery platform for practitioners, researchers, and learners who want to stay updated with industry trends, techniques, and insights without manually searching for reliable sources. The repository includes links to personal blogs, professional publications, and educational...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    quantitative

    quantitative

    Quantized transactions python3

    ...The README and associated lessons walk the user through implementing algorithms, likely covering data handling, backtesting, and maybe simple trading logic. As an open-source educational resource, it’s designed for Python users interested in automatic trading, algorithmic strategies, and financial data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Neural Network Visualization

    Neural Network Visualization

    Project for processing neural networks and rendering to gain insights

    nn_vis is a minimalist visualization tool for neural networks written in Python using OpenGL and Pygame. It provides an interactive, graphical representation of how data flows through neural network layers, offering a unique educational experience for those new to deep learning or looking to explain it visually. By animating input, weights, activations, and outputs, the tool demystifies neural network operations and helps users intuitively grasp complex concepts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    FairScale

    FairScale

    PyTorch extensions for high performance and large scale training

    FairScale is a collection of PyTorch performance and scaling primitives that pioneered many of the ideas now used for large-model training. It introduced Fully Sharded Data Parallel (FSDP) style techniques that shard model parameters, gradients, and optimizer states across ranks to fit bigger models into the same memory budget. The library also provides pipeline parallelism, activation checkpointing, mixed precision, optimizer state sharding (OSS), and auto-wrapping policies that reduce...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Name-That-Hash

    Name-That-Hash

    Identify MD5, SHA256 and 300+ other hashes

    ...One of its core ideas is popularity-aware ranking: when you feed in a hash, it prioritizes likely real-world types such as NTLM over obscure ones like Skype hashes, instead of treating them equally. The tool provides concise “hash summaries” that explain where a given hash format is commonly used, helping users decide how to proceed with cracking or further analysis. Name-That-Hash is accessible via a Python CLI (nth) and also exposes an API and JSON output, making it easy to plug into other tools or workflows, and there is also a web app that requires no local installation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Sandia Cyber Omni Tracker

    Sandia Cyber Omni Tracker

    Sandia Cyber Omni Tracker (SCOT)

    SCOT is an open-source cybersecurity incident response and threat intelligence management system developed by Sandia National Laboratories. It centralizes alert collection, facilitates team collaboration, and enriches data with automated analysis, enhancing the efficiency and effectiveness of security operations teams.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Google IPs

    Google IPs

    Public IP address ranges associated with Google infrastructure

    Google-IPs aggregates public IP address ranges that are associated with Google’s infrastructure, collecting them in machine-readable formats useful for routing and firewall rules. The list is helpful for administrators who need to whitelist Google endpoints, analyze traffic, or tune proxies and CDN configurations. By centralizing ranges that are otherwise spread across announcements and registries, it saves time and reduces misconfiguration risk. The repository typically includes CIDR blocks...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Julia.jl

    Julia.jl

    Curated decibans of Julia programming language

    Julia.jl is a curated collection of knowledge resources for the Julia programming language, designed to support high-performance numerical analysis and computational science. The repository aggregates diverse content across domains such as mathematics, physics, data science, optimization, machine learning, and supercomputing. It functions as a structured index, helping developers, researchers, and learners easily find materials to deepen their understanding of Julia’s ecosystem. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Whisper Library

    Whisper Library

    Whisper is a file-based time-series database format for Graphite

    Whisper is one of three components within the Graphite project. Whisper is a fixed-size database, similar in design and purpose to RRD (round-robin-database). It provides fast, reliable storage of numeric data over time. Whisper allows for higher resolution (seconds per point) of recent data to degrade into lower resolutions for long-term retention of historical data. Copies data from src in dst, if missing. Unlike whisper-merge, don't overwrite data that's already present in the target...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    CoolplaySpark

    CoolplaySpark

    Spark Cool Play: Spark source code analysis, Spark class library, etc.

    CoolplaySpark is a learning and practice repository designed to help users understand and work with Apache Spark. It serves as a companion resource for the book 深入理解Spark核心思想与源码分析 (In-Depth Understanding of Spark’s Core Concepts and Source Code Analysis). The project contains annotated examples, explanations, and exercises that guide learners through Spark’s architecture, execution model, and source code internals. It is particularly valuable for developers who want to strengthen their understanding of Spark by not only using it as a data processing engine but also exploring how its internals function. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    comby

    comby

    A code rewrite tool for structural search and replace that supports

    Comby is a tool for searching and changing code structure. Use lightweight templates to easily search and change code, HTML, or JSON. Comby is designed to work on any language or data format. Perform richer search and replace because Comby understands the syntax of code blocks, strings, and comments for your language. Comby is ideal for touching up pieces of code. Use it to translate code like this Python 2 to 3 fixer on the right to replace deprecated methods. Easily write one-off refactors or a collection of quick fixes customized to your project. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Google Cloud Vision API examples

    Google Cloud Vision API examples

    Sample code for Google Cloud Vision

    The cloud-vision repository is a sample code collection for the Google Cloud Vision API that shows developers how to implement image analysis tasks across a wide range of languages and platforms. It contains examples organized by language and environment, including Go, Java, Node.js, PHP, Python, Ruby, .NET, Android, iOS, and even a Chrome extension, which makes it especially valuable as a cross-platform learning resource. The repository demonstrates concrete image understanding use cases, such as landmark detection and mobile photo analysis with label and face detection, so developers can see how Vision API outputs are consumed in real interfaces and workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    waybackurls

    waybackurls

    Fetch all the URLs that the Wayback Machine knows about for a domain

    ...The tool is particularly valuable in security research, bug bounty hunting, and penetration testing, as it uncovers endpoints that may no longer be publicly linked but still exist on servers. By leveraging archived data, waybackurls helps identify hidden attack surfaces, legacy APIs, and forgotten resources that could be vulnerable. Its design is intentionally simple and efficient, focusing on delivering large volumes of URLs quickly with minimal configuration. The output can be combined with other tools for further analysis, such as filtering parameters or probing endpoints.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    Binarytree

    Binarytree

    Python library for studying Binary Trees

    Binarytree is Python library that lets you generate, visualize, inspect and manipulate binary trees. Skip the tedious work of setting up test data, and dive straight into practicing algorithms. Heaps and BSTs (binary search trees) are also supported. Binarytree supports another representation which is more compact but without the indexing properties.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a direct hands-on reference for students following the 2022 recorded lecture series. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Interpret-Text

    Interpret-Text

    State-of-the-art explainers for text-based machine learning models

    A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard. Interpret-Text builds on Interpret, an open source python package for training interpretable models and helping to explain blackbox machine learning systems. We have added extensions to support text models. Interpret-Text incorporates community-developed interpretability techniques for NLP models and a visualization dashboard to view the results. Users can run their experiments across multiple state-of-the-art explainers and easily perform comparative analysis on them. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Pythonidae

    Pythonidae

    Curated decibans of scientific programming resources in Python

    ...While the primary focus is on Python, some entries also highlight resources implemented in other languages like Julia, R, Go, and Java. The project emphasizes open contribution, allowing the community to continuously expand and refine the index. By gathering these resources in one place, Pythonidae acts as a central hub for scientific and data-driven programming with Python.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    TensorFlow Examples

    TensorFlow Examples

    TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

    TensorFlow Examples is a comprehensive repository of example implementations, tutorials, and reference code intended to help newcomers and intermediate learners dive into TensorFlow quickly. It contains both Jupyter notebooks and raw source code, covering a broad range of tasks: from basic machine-learning and neural-network models to more advanced use cases, using both TensorFlow v1 and v2 APIs. For clarity and educational value, each example is accompanied by explanatory comments or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Pandas TA

    Pandas TA

    Python 3 Pandas Extension with 130+ Indicators

    Technical Analysis Indicators - Pandas TA is an easy-to-use Python 3 Pandas Extension with 130+ Indicators. Pandas Technical Analysis (Pandas TA) is an easy-to-use library that leverages the Pandas package with more than 130 Indicators and Utility functions and more than 60 TA Lib Candlestick Patterns. Many commonly used indicators are included, such as: Candle Pattern(cdl_pattern), Simple Moving Average (sma) Moving Average Convergence Divergence (macd), Hull Exponential Moving Average (hma), Bollinger Bands (bbands), On-Balance Volume (obv), Aroon & Aroon Oscillator (aroon), Squeeze (squeeze) and many more.
    Downloads: 389 This Week
    Last Update:
    See Project
  • 24
    PyTorchVideo

    PyTorchVideo

    A deep learning library for video understanding research

    ...The library includes efficient implementations of state-of-the-art architectures such as SlowFast, X3D, and MViT, optimized for both research prototyping and production inference. It supports video I/O pipelines, data augmentation, distributed training, and mixed precision computation for large-scale experiments. PyTorchVideo also connects seamlessly with other Meta AI tools such as Detectron2 and PyTorch3D for multimodal video analysis. Designed to accelerate research and deployment, it serves as a unified framework for reproducible, high-performance video AI development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    libpython-clj

    libpython-clj

    Python bindings for Clojure

    libpython-clj is a deep interop library enabling you to load and use Python modules from within Clojure as if they were native namespaces—and even extend Python objects from Clojure. It bridges to the Python C API, preserving REPL‑based workflows. Bridge between JVM objects and Python objects easily; use Python in your Java and use some Java in your Python. Python objects are linked to the JVM GC such that when they are no longer reachable from the JVM, their references are released....
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB