Showing 1158 open source projects for "python data analysis"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Polars

    Polars

    Dataframes powered by a multithreaded, vectorized query engine

    Polars is a high-performance, multi-language DataFrame library built in Rust using Apache Arrow. It delivers blazing-fast, vectorized, and parallel data manipulation with both eager and lazy execution, making it an excellent tool for data processing in Python, Rust, Node.js, R, and SQL contexts.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    .NET Interactive

    .NET Interactive

    .NET Interactive takes .NET and embeds it into your experiences

    .NET Interactive takes the power of .NET and embeds it into your interactive experiences. Share code, explore data, write, and learn across your apps in ways you couldn't before. We recently introduced the .NET Interactive Notebooks extension for Visual Studio Code, which adds support for .NET Interactive using the new Visual Studio Code native notebook feature. We encourage you to try it out. .NET Interactive enables users to mix languages in a single notebook or cell without a wrapper. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Gigapipe

    Gigapipe

    The Open-Source Polyglot Observability Warehouse

    Gigapipe is an open-source, polyglot observability platform designed to unify logs, metrics, traces, and profiling data into a single, lightweight system. It serves as an all-in-one alternative to traditional observability stacks by implementing compatibility with widely used standards such as Loki, Prometheus, Tempo, and Pyroscope, allowing it to integrate seamlessly with existing tools and workflows. The platform supports ingestion from multiple sources, including OpenTelemetry and various vendor-specific formats, enabling flexible data collection without requiring complex middleware. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    PyMC3

    PyMC3

    Probabilistic programming in Python

    PyMC3 allows you to write down models using an intuitive syntax to describe a data generating process. Fit your model using gradient-based MCMC algorithms like NUTS, using ADVI for fast approximate inference — including minibatch-ADVI for scaling to large datasets, or using Gaussian processes to build Bayesian nonparametric models. PyMC3 includes a comprehensive set of pre-defined statistical distributions that can be used as model building blocks. Sometimes an unknown parameter or variable...
    Downloads: 4 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Super-Linter

    Super-Linter

    Combination of multiple linters to install as a GitHub Action

    This repository is for the GitHub Action to run a Super-Linter. It is a simple combination of various linters, written in bash, to help validate your source code. The super-linter finds issues and reports them to the console output. Fixes are suggested in the console output but not automatically fixed, and a status check will show up as failed on the pull request. The design of the Super-Linter is currently to allow linting to occur in GitHub Actions as a part of continuous integration...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    tqdm

    tqdm

    A Fast, Extensible Progress Bar for Python and CLI

    tqdm is a fast, extensible progress bar for Python and CLI that enables you to see the progress of your loops in a clear and smart way. Simply wrap any iterable with tqdm(iterable), and sit back and watch that progress meter go! tqdm can be wrapped around any iterable, or executed as a module with pipes. Just by inserting tqdm (or python -m tqdm) between pipes will pass through all stdin to stdout while printing progress to stderr. tqdm does not require any dependencies, has a very...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Gradio

    Gradio

    Create UIs for your machine learning model in Python in 3 minutes

    ...One of the best ways to share your machine learning model, API, or data science workflow with others is to create an interactive demo that allows your users or colleagues to try out the demo in their browsers.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    Avalanche

    Avalanche

    End-to-End Library for Continual Learning based on PyTorch

    Avalanche is an end-to-end Continual Learning library based on Pytorch, born within ContinualAI with the unique goal of providing a shared and collaborative open-source (MIT licensed) codebase for fast prototyping, training and reproducible evaluation of continual learning algorithms. Avalanche can help Continual Learning researchers in several ways. This module maintains a uniform API for data handling: mostly generating a stream of data from one or more datasets. It contains all the major...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Django Rules

    Django Rules

    Awesome Django authorization, without the database

    rules is a tiny but powerful app providing object-level permissions to Django, without requiring a database. At its core, it is a generic framework for building rule-based systems, similar to decision trees. It can also be used as a standalone library in other contexts and frameworks. Versatile. Decorate callables to build complex graphs of predicates. Predicates can be any type of callable -- simple functions, lambdas, methods, callable class objects, partial functions, decorated functions,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    RecBole

    RecBole

    A unified, comprehensive and efficient recommendation library

    A unified, comprehensive and efficient recommendation library. We design general and extensible data structures to unify the formatting and usage of various recommendation datasets. We implement more than 100 commonly used recommendation algorithms and provide formatted copies of 28 recommendation datasets. We support a series of widely adopted evaluation protocols or settings for testing and comparing recommendation algorithms.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Mobly

    Mobly

    E2E test framework for tests with complex environment requirements

    Mobly is a Python-based test framework that specializes in supporting test cases that require multiple devices, complex environments, or custom hardware setups. P2P data transfer between two devices. Conference calls across three phones. Wearable device interacting with a phone. Internet-Of-Things devices interacting with each other. Testing RF characteristics of devices with special equipment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Blue Whale Configuration Platform

    Blue Whale Configuration Platform

    Blue Whale smart cloud configuration platform

    Has accumulated experience in supporting hundreds of Tencent businesses, compatible with various complex system architectures, born in operation and maintenance, and proficient in operation and maintenance. From configuration management to job execution, task scheduling and monitoring self-healing, and then through operation and maintenance big data analysis to assist operational decision-making, it covers the full-cycle assurance management of business operations in a comprehensive manner. The open PaaS has a powerful development framework and scheduling engine, as well as a complete operation and maintenance development training system, which helps the rapid transformation and upgrading of operation and maintenance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Jupyter Docker Stacks

    Jupyter Docker Stacks

    Ready-to-run Docker images containing Jupyter applications

    ...Users can pull a particular stack image and launch a Jupyter server without worrying about installing Python, R, or complex dependencies themselves — everything needed is baked into the container. This makes the stacks especially useful for education, demos, collaborative coding, and CI/CD workflows where consistent environments are crucial, and it integrates smoothly with cloud platforms, JupyterHub deployments, and Binder for interactive sharing.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    TensorFlow

    TensorFlow

    TensorFlow is an open source library for machine learning

    Originally developed by Google for internal use, TensorFlow is an open source platform for machine learning. Available across all common operating systems (desktop, server and mobile), TensorFlow provides stable APIs for Python and C as well as APIs that are not guaranteed to be backwards compatible or are 3rd party for a variety of other languages. The platform can be easily deployed on multiple CPUs, GPUs and Google's proprietary chip, the tensor processing unit (TPU). TensorFlow...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 15
    CodiumAI PR-Agent

    CodiumAI PR-Agent

    AI-Powered tool for automated pull request analysis

    CodiumAI PR-Agent is an open-source tool aiming to help developers review pull requests faster and more efficiently. It automatically analyzes the pull request and can provide several types of commands. See the Usage Guide for instructions how to run the different tools from CLI, online usage, Or by automatically triggering them when a new PR is opened. You can try GPT-4 powered PR-Agent, on your public GitHub repository, instantly. Just mention @CodiumAI-Agent and add the desired command in...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Mikro Orm

    Mikro Orm

    TypeScript ORM for Node.js based on Data Mapper, Unit of Work

    TypeScript ORM for Node.js based on Data Mapper, Unit of Work and Identity Map patterns. MikroORM allows handling transactions automatically. When you call em.flush(), all computed changes are wrapped inside a database transaction. Uses source code analysis so you do not have to repeat yourself when defining entities. Simply define correct TypeScript types and you are good to go!
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    statsmodels

    statsmodels

    Statsmodels, statistical modeling and econometrics in Python

    statsmodels is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests, and statistical data exploration. An extensive list of result statistics are available for each estimator. The results are tested against existing statistical packages to ensure that they are correct.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Laravel Workflow

    Laravel Workflow

    Durable workflow engine that allows users to track job status

    ...Activities are individual tasks or pieces of logic that are executed as part of a workflow. Laravel Workflow can be used to automate and manage complex processes, such as financial transactions, data analysis, data pipelines, micro-services, job tracking, user signup flows, sagas and other business processes. By using Laravel Workflow, developers can break down large, complex processes into smaller, modular units that can be easily maintained and updated.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 19
    Numbast

    Numbast

    Build an automated pipeline that converts CUDA APIs into Numba

    ...This approach significantly improves developer productivity by reducing boilerplate code and ensuring consistency between C++ and Python interfaces. Numbast is particularly useful for teams working with custom CUDA libraries or extending existing ones into Python ecosystems for data science and machine learning. It complements tools like Numba, which compile Python code into GPU-executable kernels, by expanding the range of accessible CUDA functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Homemade Machine Learning

    Homemade Machine Learning

    Python examples of popular machine learning algorithms

    homemade-machine-learning is a repository by Oleksii Trekhleb containing Python implementations of classic machine-learning algorithms done “from scratch”, meaning you don’t rely heavily on high-level libraries but instead write the logic yourself to deepen understanding. Each algorithm is accompanied by mathematical explanations, visualizations (often via Jupyter notebooks), and interactive demos so you can tweak parameters, data, and observe outcomes in real time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    FATE

    FATE

    An industrial grade federated learning framework

    FATE (Federated AI Technology Enabler) is the world's first industrial grade federated learning open source framework to enable enterprises and institutions to collaborate on data while protecting data security and privacy. It implements secure computation protocols based on homomorphic encryption and multi-party computation (MPC). Supporting various federated learning scenarios, FATE now provides a host of federated learning algorithms, including logistic regression, tree-based algorithms,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SQL Explorer

    SQL Explorer

    Easily share data across your company via SQL queries

    SQL Explorer aims to make the flow of data between people fast, simple, and confusion-free. It is a Django-based application that you can add to an existing Django site, or use as a standalone business intelligence tool. Quickly write and share SQL queries in a simple, usable SQL editor, preview the results in the browser, share links, download CSV, JSON, or Excel files (and even expose queries as API endpoints, if desired), and keep the information flowing! Comes with support for multiple...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 23
    OWL

    OWL

    Optimized Workforce Learning for General Multi-Agent Assistance

    OWL (Optimized Workforce Learning) is a sophisticated open-source framework built on the CAMEL-AI ecosystem for orchestrating teams of AI agents to collaboratively solve complex, real-world tasks with dynamic planning and automation capabilities. Unlike single-agent systems, it treats task completion as a collaborative workforce where agents take on specialized roles (planning, execution, analysis) and coordinate via a modular multi-agent architecture that supports flexible teamwork across...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Mythril

    Mythril

    Security analysis tool for EVM bytecode. Supports smart contracts

    Mythril is a security analysis tool for EVM bytecode. It detects security vulnerabilities in smart contracts built for Ethereum, Hedera, Quorum, Vechain, Roostock, Tron and other EVM-compatible blockchains. It uses symbolic execution, SMT solving and taint analysis to detect a variety of security vulnerabilities. It's also used (in combination with other tools and techniques) in the MythX security analysis platform. If you are a smart contract developer, we recommend using MythX tools which...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 8 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB