Showing 283 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    CARTOframes

    CARTOframes

    CARTO Python package for data scientists

    A Python package for integrating CARTO maps, analysis, and data services into data science workflows. Python data analysis workflows often rely on the de facto standards pandas and Jupyter notebooks. Integrating CARTO into this workflow saves data scientists time and energy by not having to export datasets as files or retain multiple copies of the data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    iJEPA

    iJEPA

    Official codebase for I-JEPA

    i-JEPA (Image Joint-Embedding Predictive Architecture) is a self-supervised learning framework that predicts missing high-level representations rather than reconstructing pixels. A context encoder sees visible regions of an image and predicts target embeddings for masked regions produced by a slowly updated target encoder, focusing learning on semantics instead of texture. This objective sidesteps generative pixel losses and avoids heavy negative sampling, producing features that transfer...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Robin-Stocks API Library

    Robin-Stocks API Library

    This is a library to use with Robinhood Financial App

    This is a library to use with Robinhood Financial App. It currently supports trading crypto-currencies, options, and stocks. In addition, it can be used to get real-time ticker information, assess the performance of your portfolio, and can also get tax documents, total dividends paid, and more. The code is simple to use, easy to understand, and easy to modify. With this library, you can view information on stocks, options, and cryptocurrencies in real-time, create your own robo-investor or...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    DIG

    DIG

    A library for graph deep learning research

    The key difference with current graph deep learning libraries, such as PyTorch Geometric (PyG) and Deep Graph Library (DGL), is that, while PyG and DGL support basic graph deep learning operations, DIG provides a unified testbed for higher level, research-oriented graph deep learning tasks, such as graph generation, self-supervised learning, explainability, 3D graphs, and graph out-of-distribution. If you are working or plan to work on research in graph deep learning, DIG enables you to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Alpa

    Alpa

    Training and serving large-scale neural networks

    Alpa is a system for training and serving large-scale neural networks. Scaling neural networks to hundreds of billions of parameters has enabled dramatic breakthroughs such as GPT-3, but training and serving these large-scale neural networks require complicated distributed system techniques. Alpa aims to automate large-scale distributed training and serving with just a few lines of code.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 6
    CSI-Math-Notation-PostfixInfix

    CSI-Math-Notation-PostfixInfix

    Perl Lib Math Notation

    ... - The library provide: - Convert INFIX expressions to POSTFIX; - Convert POSTFIX expressions to INFIX and; - Perform POSTFIX context validations. - Context validation can be implemented in item selection routines or data context validation, when it is possible to identify data to be selected or ignored in some data analysis process. * NOTE: - Before any implementation, we recommend details in WIKI (https://sourceforge.net/p/csi-math-notation-postfixinfix/wiki/) or the CPAN Perl Modules, see: https://metacpan.org/pod/Math::Notation::PostfixInfix * Support: The Support Service is FREE. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Data science blogs

    Data science blogs

    A curated list of data science blogs

    Data Science Blogs is a curated repository that aggregates a wide range of high-quality blogs and resources related to data science, machine learning, and analytics into a single organized collection. It serves as a discovery platform for practitioners, researchers, and learners who want to stay updated with industry trends, techniques, and insights without manually searching for reliable sources. The repository includes links to personal blogs, professional publications, and educational...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    quantitative

    quantitative

    Quantized transactions python3

    ...The README and associated lessons walk the user through implementing algorithms, likely covering data handling, backtesting, and maybe simple trading logic. As an open-source educational resource, it’s designed for Python users interested in automatic trading, algorithmic strategies, and financial data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Neural Network Visualization

    Neural Network Visualization

    Project for processing neural networks and rendering to gain insights

    nn_vis is a minimalist visualization tool for neural networks written in Python using OpenGL and Pygame. It provides an interactive, graphical representation of how data flows through neural network layers, offering a unique educational experience for those new to deep learning or looking to explain it visually. By animating input, weights, activations, and outputs, the tool demystifies neural network operations and helps users intuitively grasp complex concepts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    FairScale

    FairScale

    PyTorch extensions for high performance and large scale training

    FairScale is a collection of PyTorch performance and scaling primitives that pioneered many of the ideas now used for large-model training. It introduced Fully Sharded Data Parallel (FSDP) style techniques that shard model parameters, gradients, and optimizer states across ranks to fit bigger models into the same memory budget. The library also provides pipeline parallelism, activation checkpointing, mixed precision, optimizer state sharding (OSS), and auto-wrapping policies that reduce...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Name-That-Hash

    Name-That-Hash

    Identify MD5, SHA256 and 300+ other hashes

    ...One of its core ideas is popularity-aware ranking: when you feed in a hash, it prioritizes likely real-world types such as NTLM over obscure ones like Skype hashes, instead of treating them equally. The tool provides concise “hash summaries” that explain where a given hash format is commonly used, helping users decide how to proceed with cracking or further analysis. Name-That-Hash is accessible via a Python CLI (nth) and also exposes an API and JSON output, making it easy to plug into other tools or workflows, and there is also a web app that requires no local installation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Jraph

    Jraph

    A Graph Neural Network Library in Jax

    Jraph (pronounced “giraffe”) is a lightweight JAX library developed by Google DeepMind for building and experimenting with graph neural networks (GNNs). It provides an efficient and flexible framework for representing, manipulating, and training models on graph-structured data. The core of Jraph is the GraphsTuple data structure, which enables users to define graphs with arbitrary node, edge, and global attributes, and to batch variable-sized graphs efficiently for JAX’s just-in-time...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Sandia Cyber Omni Tracker

    Sandia Cyber Omni Tracker

    Sandia Cyber Omni Tracker (SCOT)

    SCOT is an open-source cybersecurity incident response and threat intelligence management system developed by Sandia National Laboratories. It centralizes alert collection, facilitates team collaboration, and enriches data with automated analysis, enhancing the efficiency and effectiveness of security operations teams.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Google IPs

    Google IPs

    Public IP address ranges associated with Google infrastructure

    Google-IPs aggregates public IP address ranges that are associated with Google’s infrastructure, collecting them in machine-readable formats useful for routing and firewall rules. The list is helpful for administrators who need to whitelist Google endpoints, analyze traffic, or tune proxies and CDN configurations. By centralizing ranges that are otherwise spread across announcements and registries, it saves time and reduces misconfiguration risk. The repository typically includes CIDR blocks...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Julia.jl

    Julia.jl

    Curated decibans of Julia programming language

    Julia.jl is a curated collection of knowledge resources for the Julia programming language, designed to support high-performance numerical analysis and computational science. The repository aggregates diverse content across domains such as mathematics, physics, data science, optimization, machine learning, and supercomputing. It functions as a structured index, helping developers, researchers, and learners easily find materials to deepen their understanding of Julia’s ecosystem. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Whisper Library

    Whisper Library

    Whisper is a file-based time-series database format for Graphite

    Whisper is one of three components within the Graphite project. Whisper is a fixed-size database, similar in design and purpose to RRD (round-robin-database). It provides fast, reliable storage of numeric data over time. Whisper allows for higher resolution (seconds per point) of recent data to degrade into lower resolutions for long-term retention of historical data. Copies data from src in dst, if missing. Unlike whisper-merge, don't overwrite data that's already present in the target...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    CoolplaySpark

    CoolplaySpark

    Spark Cool Play: Spark source code analysis, Spark class library, etc.

    CoolplaySpark is a learning and practice repository designed to help users understand and work with Apache Spark. It serves as a companion resource for the book 深入理解Spark核心思想与源码分析 (In-Depth Understanding of Spark’s Core Concepts and Source Code Analysis). The project contains annotated examples, explanations, and exercises that guide learners through Spark’s architecture, execution model, and source code internals. It is particularly valuable for developers who want to strengthen their understanding of Spark by not only using it as a data processing engine but also exploring how its internals function. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    comby

    comby

    A code rewrite tool for structural search and replace that supports

    Comby is a tool for searching and changing code structure. Use lightweight templates to easily search and change code, HTML, or JSON. Comby is designed to work on any language or data format. Perform richer search and replace because Comby understands the syntax of code blocks, strings, and comments for your language. Comby is ideal for touching up pieces of code. Use it to translate code like this Python 2 to 3 fixer on the right to replace deprecated methods. Easily write one-off refactors or a collection of quick fixes customized to your project. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Google Cloud Vision API examples

    Google Cloud Vision API examples

    Sample code for Google Cloud Vision

    The cloud-vision repository is a sample code collection for the Google Cloud Vision API that shows developers how to implement image analysis tasks across a wide range of languages and platforms. It contains examples organized by language and environment, including Go, Java, Node.js, PHP, Python, Ruby, .NET, Android, iOS, and even a Chrome extension, which makes it especially valuable as a cross-platform learning resource. The repository demonstrates concrete image understanding use cases, such as landmark detection and mobile photo analysis with label and face detection, so developers can see how Vision API outputs are consumed in real interfaces and workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    waybackurls

    waybackurls

    Fetch all the URLs that the Wayback Machine knows about for a domain

    ...The tool is particularly valuable in security research, bug bounty hunting, and penetration testing, as it uncovers endpoints that may no longer be publicly linked but still exist on servers. By leveraging archived data, waybackurls helps identify hidden attack surfaces, legacy APIs, and forgotten resources that could be vulnerable. Its design is intentionally simple and efficient, focusing on delivering large volumes of URLs quickly with minimal configuration. The output can be combined with other tools for further analysis, such as filtering parameters or probing endpoints.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23
    Binarytree

    Binarytree

    Python library for studying Binary Trees

    Binarytree is Python library that lets you generate, visualize, inspect and manipulate binary trees. Skip the tedious work of setting up test data, and dive straight into practicing algorithms. Heaps and BSTs (binary search trees) are also supported. Binarytree supports another representation which is more compact but without the indexing properties.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a direct hands-on reference for students following the 2022 recorded lecture series. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Interpret-Text

    Interpret-Text

    State-of-the-art explainers for text-based machine learning models

    A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard. Interpret-Text builds on Interpret, an open source python package for training interpretable models and helping to explain blackbox machine learning systems. We have added extensions to support text models. Interpret-Text incorporates community-developed interpretability techniques for NLP models and a visualization dashboard to view the results. Users can run their experiments across multiple state-of-the-art explainers and easily perform comparative analysis on them. ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB