Showing 1158 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    PyCBC

    PyCBC

    Learn how to use PyCBC to analyze gravitational-wave data

    PyCBC is a software developed by a collaboration of LIGO, Virgo, and independent scientists. It is open source and freely available. We use PyCBC in the detection of gravitational waves from binary mergers such as GW150914. These examples explore how to analyze gravitational wave data, how we find potential signals and learn about them. Many of these tutorials will require you to make edits to config files as part of their exercises. At the moment this isn't easy to do on services like...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SageMaker Containers

    SageMaker Containers

    Create SageMaker-compatible Docker containers

    Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code. A container provides an effectively isolated environment, ensuring a consistent runtime and reliable training process. The SageMaker Training Toolkit can be easily added to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PerfKit Benchmarker

    PerfKit Benchmarker

    PerfKit Benchmarker (PKB) contains a set of benchmarks

    PerfKitBenchmarker is an open-source benchmarking framework designed to measure and compare the performance of cloud infrastructure across multiple providers in a consistent and reproducible way. It allows users to evaluate metrics such as latency, throughput, provisioning time, and system performance using a standardized set of benchmarks. The tool supports a wide range of environments, including major cloud platforms, Kubernetes clusters, and even local hardware, making it highly versatile...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Brand new cheatsheets and handouts

    Brand new cheatsheets and handouts

    Matplotlib 3.1 cheat sheet

    The Brand new cheatsheets and handouts repo is a compact, quick-reference summary of the most commonly used plotting commands and configurations in Matplotlib, intended to serve as a handy reference for experienced users who want to recall syntax or find the right function without digging into full documentation. It lays out common use cases (plot types, styling, figure configuration, saving/exporting, subplot layout, etc.) in a concise and organized format — often serving as a “cheat sheet”...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Albedo

    Albedo

    A recommender system for discovering GitHub repos

    Albedo is an open-source recommender system aimed at helping developers discover GitHub repositories by learning from activity signals. It treats repositories and developers as a graph of interactions and applies large-scale matrix factorization to model affinities, with Apache Spark providing the distributed data processing. The project focuses on implicit feedback—stars, watches, and other engagement metrics—so it can build useful recommendations without explicit ratings. A reproducible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Tensor2Tensor

    Tensor2Tensor

    Library of deep learning models and datasets

    Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Alfred-Workflow

    Alfred-Workflow

    Full-featured library for writing Alfred 3 & 4 workflows

    Alfred-Workflow is a Python helper library for Alfred 2, 3 and 4 workflow authors, developed and hosted on GitHub. Alfred workflows typically take user input, fetch data from the Web or elsewhere, filter them and display results to the user. Alfred-Workflow takes care of a lot of the details for you, allowing you to concentrate your efforts on your workflow’s functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Coronavirus Tracker API

    Coronavirus Tracker API

    API for tracking the global coronavirus (COVID-19, SARS-CoV-2)

    Provides up-to-date data about Coronavirus outbreak. Includes numbers about confirmed cases, deaths and recovered. Support multiple data sources. Specify source parameter with ?source=nyt. NYT also provides a timeseries! To view timelines of cases by US counties use ?source=nyt&timelines=true. Currently, 3 different data sources are available to retrieve the data. Worldwide Data repository operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). U.S....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Pinject

    Pinject

    A pythonic dependency injection library

    ...The library leans on Python’s introspection to minimize boilerplate, making it natural to adopt in codebases that already rely on type hints or keyword arguments. Because bindings are just Python functions and classes, refactoring remains straightforward and the DI graph is easy to reason about. Pinject is particularly useful for medium-to-large services where configuration, logging, data clients, and business logic need clean separation without resorting to manual plumbing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    TensorFlow Object Counting API

    TensorFlow Object Counting API

    The TensorFlow Object Counting API is an open source framework

    The TensorFlow Object Counting API is an open source framework built on top of TensorFlow and Keras that makes it easy to develop object counting systems. Please contact if you need professional object detection & tracking & counting project with super high accuracy and reliability! You can train TensorFlow models with your own training data to built your own custom object counter system! If you want to learn how to do it, please check one of the sample projects, which cover some of the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The TRANSIMS Studio application is an integrated development environment for the TRansportation ANalysis and SIMulation System (TRANSIMS). Components include a run time environment to execute TRANSIMS in parallel, as well as a full featured GUI.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    An Open Source IEC 61131-3 Integrated Development Environment, providing PLCOpen SoftPLC programming, CanOpen IO's, and SVG based HMI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    gditools

    gditools

    A Python program/library aimed at GD-ROM image files.

    This Python program/library is designed to handle GD-ROM image (GDI) files. It can be used to list files, extract data, generate sorttxt file, extract bootstrap (IP.BIN) file and more. This project can be used in standalone mode, in interactive mode or as a library in another Python program (check the 'addons' folder to learn how). For your convenience, you can use the gditools.py GUI program supplied in the Files section (optional).
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14
    jsonfield

    jsonfield

    A reusable Django model field for storing ad-hoc JSON data

    jsonfield is a reusable model field that allows you to store validated JSON, automatically handling serialization to and from the database. To use, add jsonfield.JSONField to one of your models. Note: django.contrib.postgres now supports PostgreSQL's jsonb type, which includes extended querying capabilities. If you're an end user of PostgreSQL and want full-featured JSON support, then it is recommended that you use the built-in JSONField. However, jsonfield is still useful when your app...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Slacker

    Slacker

    Full-featured Python interface for the Slack API

    Slacker is a full-featured Python interface for the Slack API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Hercules

    Hercules

    Gaining advanced insights from Git repository history

    ...The second is a Python script that shows some predefined plots over the collected data. These two tools are normally used together through a pipe. It is possible to write custom analyses using the plugin system. It is also possible to merge several analysis results together - relevant for organizations. The analyzed commit history includes branches, merges, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    CrypTen

    CrypTen

    A framework for Privacy Preserving Machine Learning

    CrypTen is a research framework developed by Facebook Research for privacy-preserving machine learning built directly on top of PyTorch. It provides a secure and intuitive environment for performing computations on encrypted data using Secure Multiparty Computation (SMPC). Designed to make secure computation accessible to machine learning practitioners, CrypTen introduces a CrypTensor object that behaves like a regular PyTorch tensor, allowing users to seamlessly apply automatic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Graph Nets library

    Graph Nets library

    Build Graph Nets in Tensorflow

    Graph Nets, developed by Google DeepMind, is a Python library designed for constructing and training graph neural networks (GNNs) using TensorFlow and Sonnet. It provides a high-level, flexible framework for building neural architectures that operate directly on graph-structured data. A graph network takes graphs as inputs, consisting of edges, nodes, and global attributes, and produces updated graphs with modified feature representations at each level.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    qlogtimeliner

    Your application log analysis tool.

    This tool helps with analysis multithreaded logs by visualising log events on a timeline. It is user responsibility to create rules for log parsing in actionscript (like JavaScript) You need to have latest Qt sdk installed in order to build it from source code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Miasm

    Miasm

    Reverse engineering framework in Python

    The Miasm intermediate representation is used for multiple task: emulation through its jitter engine, symbolic execution, DSE, program analysis, but the intermediate representation can be a bit hard to read. We will present in this article new tricks Miasm has learned in 2018. Among them, the SSA/Out-of-SSA transformation, expression propagation and high-level operators can be joined to “lift” Miasm IR to a more human-readable language. We use graphviz to illustrate some graphs. Its layout...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    PixieDust

    PixieDust

    Python Helper library for Jupyter Notebooks

    PixieDust is an open source Python helper library that works as an add-on to Jupyter notebooks to improve the user experience of working with data. It also fills a gap for users who have no access to configuration files when a notebook is hosted on the cloud.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    RecNN

    RecNN

    Reinforced Recommendation toolkit built around pytorch 1.7

    This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Optimized Storage for temporal Data

    open Optimized Storage of time series data

    Beta version. Base class for optimized storage of time series data. Uses any kind of relational database. Cross plateform with multiple languages (C++, C#, Java). Conditional storage based on value variation : DeltaValue and DeltaTime params. Get back data without losts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Twint

    Twint

    An advanced Twitter scraping & OSINT tool written in Python

    Twint is an advanced open-source Twitter scraping and OSINT tool written in Python that extracts tweets, user data, followers, likes, and more—without relying on Twitter’s API—making it highly useful for researchers, analysts, and hobbyists who want to bypass rate limits and access public Twitter data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DocBook Authoring and Publishing Suite

    DocBook Authoring and Publishing Suite

    DocBook Publishing Made Easy

    The DAPS project moved to https://github.com/openSUSE/daps The SUSE XSL Stylesheets have moved to https://github.com/openSUSE/suse-xsl To join the discussion, under https://github.com/openSUSE/daps/discussions
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB