Showing 998 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1

    TSA_CRAFT

    Automatic command line tool for TSA data analysis

    DSF is a high-throughput platform of TSA assay to screen various conditions that affect protein stability. To facilitate TSA data analysis, we developed an automatic tool "TSA-CRAFT". TSA-CRAFT was developed by integrating PERL script and Gnuplot. PERL scripts manage the entire workflow and data processing procedures of TSA data analysis. On the other hand, Gnuplot takes over the curve fitting and result presentation works. All analysed results are coded in an html file that can be easily displayed by a web browser.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Microsoft Python Language Server

    Microsoft Python Language Server

    Microsoft Language Server for Python

    ...To control the visibility and severity of the diagnotics, there are a number of lists that can be set in the user configuration which make use of each diagnostic's error code. During analysis language server produces Python code from compiled modules and builtins which is similar to Python module stubs. It may also produce database files holding module analysis for faster retrieval later.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Django Celery

    Django Celery

    Old Celery integration project for Django

    Celery is a simple, flexible, and reliable distributed system to process vast amounts of messages, while providing operations with the tools required to maintain such a system. It’s a task queue with focus on real-time processing, while also supporting task scheduling. Celery has a large and diverse community of users and contributors, you should come join us on IRC or our mailing-list. Celery is Open Source and licensed under the BSD License. A task queue’s input is a unit of work called a...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Nebula reporter

    Nebula reporter

    The optional reporter container which reads nebula reports from Kafka

    Nebula is an open source-distributed Docker orchestrator designed for massive scales (tens of thousands of servers/worker devices), unlike Mesos/Swarm/Kubernetes it has the ability to have workers distributed on high latency connections (such as the internet) yet have the pods(containers) be managed centrally with changes taking effect (almost) immediately, this makes Nebula ideal for managing a vast cluster of servers\devices across the globe. Ever wandered how your going to push an update...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Nebula worker

    Nebula worker

    The worker node manager container which manages nebula nodes

    Nebula is a open source distributed Docker orchestrator designed for massive scales (tens of thousands of servers/worker devices), unlike Mesos/Swarm/Kubernetes it has the ability to have workers distributed on high latency connections (such as the internet) yet have the pods(containers) be managed centrally with changes taking affect (almost) immediately, this makes Nebula ideal for managing a vast cluster of servers\devices across the globe, some example use cases are IoT devices,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    RobotsDisallowed

    RobotsDisallowed

    A curated list of the most common and most interesting robots.txt

    RobotsDisallowed is a public catalog that tracks websites and organizations explicitly blocking AI and web-scraping crawlers in their robots.txt or related mechanisms. It focuses on documenting the growing trend of content owners asserting control over how their data is used for model training and automated harvesting. The project aggregates domains, notes the targeted bots or user agents, and surfaces patterns for researchers, policymakers, and tool builders. It serves both as a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Facets

    Facets

    Visualizations for machine learning datasets

    The power of machine learning comes from its ability to learn patterns from large amounts of data. Understanding your data is critical to building a powerful machine learning system. Facets contains two robust visualizations to aid in understanding and analyzing machine learning datasets. Get a sense of the shape of each feature of your dataset using Facets Overview, or explore individual observations using Facets Dive. Explore Facets Overview and Facets Dive on the UCI Census Income...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    The Google Cloud Developer's Cheat Sheet

    The Google Cloud Developer's Cheat Sheet

    Cheat sheet for Google Cloud developers

    Every product in the Google Cloud family described in <=4 words (with liberal use of hyphens and slashes) by the Google Developer Relations Team. This list only includes products that are publicly available. There are several products in pre-release/private-alpha that will not be included until they go public beta or GA. Many of these products have a free tier. There is also a free trial that will enable you try almost everything. API platforms and ecosystems, developer and management tools,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Active Learning

    Active Learning

    Framework and examples for active learning with machine learning model

    Active Learning is a Python-based research framework developed by Google for experimenting with and benchmarking various active learning algorithms. It provides modular tools for running reproducible experiments across different datasets, sampling strategies, and machine learning models. The system allows researchers to study how models can improve labeling efficiency by selectively querying the most informative data points rather than relying on uniformly sampled training sets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    TGAN

    TGAN

    Generative adversarial training for generating synthetic tabular data

    We are happy to announce that our new model for synthetic data called CTGAN is open-sourced. The new model is simpler and gives better performance on many datasets. TGAN is a tabular data synthesizer. It can generate fully synthetic data from real data. Currently, TGAN can generate numerical columns and categorical columns. TGAN has been developed and runs on Python 3.5, 3.6 and 3.7. Also, although it is not strictly required, the usage of a virtualenv is highly recommended in order to avoid interfering with other software installed in the system where TGAN is run. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Open|SpeedShop

    Open|SpeedShop is an open source multi platform Linux performance tool

    Open|SpeedShop is an open source multi platform Linux performance tool which is targeted to support performance analysis of applications running on both single node and large scale IA64, IA32, EM64T, AMD64, PPC, Blue Gene, ARM and Cray platforms.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    Personal Blog

    Personal Blog

    One article per week, the content is concise, neither salty nor light

    Personal Blog holds the source structure and article index for the author’s personal technical blog, which is closely tied to the “芋道源码” WeChat public account. It uses Markdown files and a static-site setup (with configuration like _config.yml) to organize posts about Java back-end engineering, distributed systems, and source-code deep dives. The README and index emphasize that the blog (in this repo) is paused and that new content is primarily delivered via the WeChat channel, but the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    data-science-ipython-notebooks

    data-science-ipython-notebooks

    Data science Python notebooks: Deep learning

    Data Science IPython Notebooks is a broad, curated set of Jupyter notebooks covering Python, data wrangling, visualization, machine learning, deep learning, and big data tools. It aims to be a practical map of the ecosystem, showing hands-on examples with libraries such as NumPy, pandas, matplotlib, scikit-learn, and others. Many notebooks introduce concepts step by step, then apply them to real datasets so readers can see techniques in action.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Pipelines

    Pipelines

    An experimental programming language for data flow

    Pipelines is a language and runtime for crafting massively parallel pipelines. Unlike other languages for defining data flow, the Pipeline language requires the implementation of components to be defined separately in the Python scripting language. This allows the details of implementations to be separated from the structure of the pipeline while providing access to thousands of active libraries for machine learning, data analysis, and processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Django REST Pandas

    Django REST Pandas

    Serves up Pandas dataframes via the Django REST Framework

    Django REST Pandas (DRP) provides a simple way to generate and serve pandas DataFrames via the Django REST Framework. The resulting API can serve up CSV (and a number of other formats for consumption by a client-side visualization tool like d3.js. The design philosophy of DRP enforces a strict separation between data and presentation. This keeps the implementation simple, but also has the nice side effect of making it trivial to provide the source data for your visualizations. This...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Invenio

    Invenio

    Invenio digital library framework

    Invenio is a highly customizable open-source framework for building large-scale digital repositories and research data platforms. Developed by CERN, it is designed to manage, index, and provide access to metadata-rich content such as publications, datasets, and multimedia files. Invenio provides a modular architecture, making it suitable for libraries, archives, and research institutions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    django-rest-auth

    django-rest-auth

    This app makes it extremely easy to build Django powered SPA's

    This app makes it extremely easy to build Django powered SPA's (Single Page App) or Mobile apps exposing all registration and authentication-related functionality as CBV's (Class Base View) and REST (JSON). Tivix rebuilt a NATO software system to organize and coordinate rescue missions for submarines in distress across the globe. The United Nations Partner Portal (UNPP) is a web application built for a group of UN agencies to simplify their business processes and streamline collaboration...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    JAS-mine

    JAS-mine

    Toolkit for agent-based & dynamic microsimulation modelling

    JAS-mine is a Java platform that aims at providing a unique simulation tool for discrete-event simulations, including agent-based and microsimulation models. With the aim to develop large-scale, data-driven models, the main architectural choice of JAS-mine is to use whenever possible standard, open-source tools already available in the software development community. It has built-in utilities for communicating with an underlying relational database. In addition, the platform provides standard tools which are frequently used both in agent-based modelling and dynamic microsimulations, like design of experiments (DOE), run-time monitoring and visualization with plots and graphs (GUI), I/O communication and statistical analysis. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Code Catalog in Python

    Code Catalog in Python

    Algorithms and data structures for review for coding interview

    code-catalog-python serves as a grab-bag of small, readable Python examples that illustrate common algorithms, data structures, and utility patterns. Each snippet aims to be self-contained and easy to study, with clear inputs, outputs, and the essential logic on display. The catalog format lets you scan for an example, copy it, and adapt it to your use case without wading through a large framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ShadowSocksShare

    ShadowSocksShare

    Python ShadowSocks framework

    This project obtains the shared ss(r) account from the ss(r) shared website crawler, redistributes the account and generates a subscription link by parsing and verifying the account connectivity. Since Google plus will be closed on April 2, 2019, almost all the available accounts crawled before come from Google plus. So if you are building your own website, please keep an eye on the updates of this project and redeploy using the latest source code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Kalibr Allan

    Kalibr Allan

    IMU Allan standard deviation charts

    ...While manufacturers typically provide “white noise” values in IMU datasheets, the bias instability and random walk parameters must be determined experimentally. This project enables users to compute those values using Allan variance analysis from recorded IMU data. The workflow involves recording IMU measurements with the device stationary, converting ROS bag files into MATLAB-compatible formats, and then running MATLAB scripts to generate Allan deviation plots. These plots are analyzed to determine noise density and random walk parameters for both gyroscopes and accelerometers. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Omniglot

    Omniglot

    Omniglot data set for one-shot learning

    This repository hosts the Omniglot dataset for one-shot learning, containing handwritten characters across multiple alphabets along with stroke data. It includes both MATLAB and Python starter scripts (e.g. demo.m, demo.py) to illustrate how to load the images and stroke sequences and run baseline experiments (such as classification by modified Hausdorff distance). The dataset provides both an image representation of each character and the time-ordered stroke coordinates ([x, y, t]) for each instance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Trigger Happy

    Trigger Happy

    Automate the exchanges of the data between applications

    Automate the exchanges of the data between the applications and services you use on the web. Make Twitter talk to Mastodon, make Github talk to Mattermost, store your favorite tweets by creating notes in Evernote, follow RSS feeds and post each news in Wallabag, Pocket or Evernote. The possibilities are too numerous to name all of them, but with that project you won't have to raise your little finger at all: automate everything and make your life easier. And last but not least, as this is...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Python Taint

    Python Taint

    Static Analysis Tool for Detecting Security Vulnerabilities in Python

    Static analysis of Python web applications based on theoretical foundations (Control flow graphs, fixed point, dataflow analysis) Detect command injection, SSRF, SQL injection, XSS, directory traveral etc. A lot of customization is possible. For functions from builtins or libraries, e.g. url_for or os.path.join, use the -m option to specify whether or not they return tainted values given tainted inputs, by default this file is used.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    ATLAS_mPBPK

    ATLAS_mPBPK

    Modeling and Simulation of mPBPK models

    ATLAS mPBPK is a MATLAb-based tool for modeling and Simulation of minimal Physiology Based Pharmacokinetic (mPBPK) models of small and large molecules. The tool enables the users to perform: i) PK data visualization, ii) simulation, iii) parameter optimization, and iv) local sensitivity analysis (SA) of mPBPK models in a simple and efficient manner.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB