Showing 1125 open source projects for "python data analysis"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1
    Vanilla.PDF

    Vanilla.PDF

    Cross-platform SDK for creating and modifying PDF documents

    Vanilla.PDF is a modern, high-performance, open-source C++17 SDK designed for creating, editing, signing, and analyzing PDF documents across multiple platforms. It requires no external runtime dependencies, making it lightweight and ideal for embedding into desktop applications, servers, or automation pipelines. The SDK offers full cross-platform support including Windows, Linux, macOS, and Android, with builds available for major compilers and architectures. Vanilla.PDF supports advanced...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    RecBole

    RecBole

    A unified, comprehensive and efficient recommendation library

    A unified, comprehensive and efficient recommendation library. We design general and extensible data structures to unify the formatting and usage of various recommendation datasets. We implement more than 100 commonly used recommendation algorithms and provide formatted copies of 28 recommendation datasets. We support a series of widely adopted evaluation protocols or settings for testing and comparing recommendation algorithms.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Google Maps SDK for iOS Utility Library

    Google Maps SDK for iOS Utility Library

    Google Maps SDK for iOS Utility Library

    google-maps-ios-utils is a collection of open-source utilities that extend the functionality of the Google Maps SDK for iOS. It provides additional features such as clustering, heatmaps, and geometry utilities to enhance map-based applications.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Loggifly

    Loggifly

    Get Alerts from your Docker Container Logs

    LoggiFly is a lightweight, open-source monitoring tool designed to watch Docker container logs in real time and trigger alerts, notifications, or automated actions based on predefined keywords or regular expression patterns. Instead of manually scanning logs for issues or relying solely on centralized monitoring stacks, LoggiFly proactively inspects streams of container output and notifies users through services like Ntfy, Slack, Discord, Telegram, or webhooks when significant events occur....
    Downloads: 6 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    Mobly

    Mobly

    E2E test framework for tests with complex environment requirements

    Mobly is a Python-based test framework that specializes in supporting test cases that require multiple devices, complex environments, or custom hardware setups. P2P data transfer between two devices. Conference calls across three phones. Wearable device interacting with a phone. Internet-Of-Things devices interacting with each other. Testing RF characteristics of devices with special equipment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Blue Whale Configuration Platform

    Blue Whale Configuration Platform

    Blue Whale smart cloud configuration platform

    Has accumulated experience in supporting hundreds of Tencent businesses, compatible with various complex system architectures, born in operation and maintenance, and proficient in operation and maintenance. From configuration management to job execution, task scheduling and monitoring self-healing, and then through operation and maintenance big data analysis to assist operational decision-making, it covers the full-cycle assurance management of business operations in a comprehensive manner. The open PaaS has a powerful development framework and scheduling engine, as well as a complete operation and maintenance development training system, which helps the rapid transformation and upgrading of operation and maintenance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Full Stack FastAPI and PostgreSQL

    Full Stack FastAPI and PostgreSQL

    Full stack, modern web application generator

    Generate a backend and frontend stack using Python, including interactive API documentation. Production ready Python web server using Uvicorn and Gunicorn. Very high performance, on par with NodeJS and Go (thanks to Starlette and Pydantic). Great editor support. Completion everywhere. Less time debugging. Designed to be easy to use and learn. Less time reading docs. Minimize code duplication.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Gigapipe

    Gigapipe

    The Open-Source Polyglot Observability Warehouse

    Gigapipe is an open-source, polyglot observability platform designed to unify logs, metrics, traces, and profiling data into a single, lightweight system. It serves as an all-in-one alternative to traditional observability stacks by implementing compatibility with widely used standards such as Loki, Prometheus, Tempo, and Pyroscope, allowing it to integrate seamlessly with existing tools and workflows. The platform supports ingestion from multiple sources, including OpenTelemetry and various vendor-specific formats, enabling flexible data collection without requiring complex middleware. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    FastUI

    FastUI

    Build better UIs faster

    FastUI is a library that lets developers build interactive user interfaces for FastAPI applications using Pydantic models. It automatically generates frontend components based on data schemas and endpoint logic, reducing the need for manual UI development. Designed to be type-safe, reactive, and fast, FastUI streamlines the creation of web dashboards, admin panels, and internal tools within a FastAPI backend.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Boltons

    Boltons

    250+ constructs, recipes, and snippets which extend the Python library

    Boltons is a set of pure-Python utilities in the same spirit as, and yet conspicuously missing from, the standard library. Due to the nature of utilities, application developers might want to consider other integration options. Boltons is tested against Python 2.6-2.7, 3.4-3.7, and PyPy. The majority of boltons strive to be “good enough” for a wide range of basic uses, leaving advanced use cases to Python’s myriad specialized 3rd-party libraries. In many cases the respective boltons module...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    PyMC3

    PyMC3

    Probabilistic programming in Python

    PyMC3 allows you to write down models using an intuitive syntax to describe a data generating process. Fit your model using gradient-based MCMC algorithms like NUTS, using ADVI for fast approximate inference — including minibatch-ADVI for scaling to large datasets, or using Gaussian processes to build Bayesian nonparametric models. PyMC3 includes a comprehensive set of pre-defined statistical distributions that can be used as model building blocks. Sometimes an unknown parameter or variable...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Mikro Orm

    Mikro Orm

    TypeScript ORM for Node.js based on Data Mapper, Unit of Work

    TypeScript ORM for Node.js based on Data Mapper, Unit of Work and Identity Map patterns. MikroORM allows handling transactions automatically. When you call em.flush(), all computed changes are wrapped inside a database transaction. Uses source code analysis so you do not have to repeat yourself when defining entities. Simply define correct TypeScript types and you are good to go!
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13

    Impacket

    A collection of Python classes for working with network protocols

    ...It features several protocols, including Ethernet, IP, TCP, UDP, ICMP, IGMP, ARP, NMB and SMB1, SMB2 and SMB3 and more. Impacket's object oriented API makes it easy to work with deep hierarchies of protocols. It can construct packets from scratch, as well as parse them from raw data.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Numbast

    Numbast

    Build an automated pipeline that converts CUDA APIs into Numba

    ...This approach significantly improves developer productivity by reducing boilerplate code and ensuring consistency between C++ and Python interfaces. Numbast is particularly useful for teams working with custom CUDA libraries or extending existing ones into Python ecosystems for data science and machine learning. It complements tools like Numba, which compile Python code into GPU-executable kernels, by expanding the range of accessible CUDA functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    CodiumAI PR-Agent

    CodiumAI PR-Agent

    AI-Powered tool for automated pull request analysis

    CodiumAI PR-Agent is an open-source tool aiming to help developers review pull requests faster and more efficiently. It automatically analyzes the pull request and can provide several types of commands. See the Usage Guide for instructions how to run the different tools from CLI, online usage, Or by automatically triggering them when a new PR is opened. You can try GPT-4 powered PR-Agent, on your public GitHub repository, instantly. Just mention @CodiumAI-Agent and add the desired command in...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    TensorFlow

    TensorFlow

    TensorFlow is an open source library for machine learning

    Originally developed by Google for internal use, TensorFlow is an open source platform for machine learning. Available across all common operating systems (desktop, server and mobile), TensorFlow provides stable APIs for Python and C as well as APIs that are not guaranteed to be backwards compatible or are 3rd party for a variety of other languages. The platform can be easily deployed on multiple CPUs, GPUs and Google's proprietary chip, the tensor processing unit (TPU). TensorFlow...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17
    Avalanche

    Avalanche

    End-to-End Library for Continual Learning based on PyTorch

    Avalanche is an end-to-end Continual Learning library based on Pytorch, born within ContinualAI with the unique goal of providing a shared and collaborative open-source (MIT licensed) codebase for fast prototyping, training and reproducible evaluation of continual learning algorithms. Avalanche can help Continual Learning researchers in several ways. This module maintains a uniform API for data handling: mostly generating a stream of data from one or more datasets. It contains all the major...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    FATE

    FATE

    An industrial grade federated learning framework

    FATE (Federated AI Technology Enabler) is the world's first industrial grade federated learning open source framework to enable enterprises and institutions to collaborate on data while protecting data security and privacy. It implements secure computation protocols based on homomorphic encryption and multi-party computation (MPC). Supporting various federated learning scenarios, FATE now provides a host of federated learning algorithms, including logistic regression, tree-based algorithms,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    sqlite-utils

    sqlite-utils

    Python CLI utility and library for manipulating SQLite databases

    sqlite-utils is both a Python library and a command-line tool for creating, inspecting, and transforming SQLite databases with minimal boilerplate. It focuses on making common tasks like importing CSV/JSON, exploring tables, and running ad-hoc queries feel ergonomic and scriptable. As a CLI, it lets you build databases from structured data in one line, run queries against local files or in-memory databases, output results as JSON, CSV, or pretty tables, and configure full-text search. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Flexprice

    Flexprice

    Usage-based pricing and billing for developers

    Flexprice is an open-source dynamic pricing engine designed to help online businesses and marketplaces automate and optimize their pricing strategies. It allows developers and data scientists to experiment with pricing algorithms using real-time market data, inventory levels, and historical sales to maximize revenue, conversion, or competitiveness. Built with flexibility in mind, Flexprice can be integrated into existing e-commerce infrastructure via APIs and supports simulation and A/B...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    JeeSite

    JeeSite

    A basic enterprise information development platform

    JeeSite is a high-efficiency, high-performance, and strong security open source Java EE rapid development platform based on a number of excellent open source projects, highly integrated and packaged. JeeSite is the best basic platform solution for you to quickly complete your project. JeeSite is the best study case for you to learn the Java platform. JeeSite is also the best assistant for private work. JeeSite is a Java-based development platform built on the Spring Framework, with Spring...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23
    Hypothesis

    Hypothesis

    The property-based testing library for Python

    Hypothesis is a powerful library for property-based testing in Python. Instead of writing specific test cases, users define properties and Hypothesis generates random inputs to uncover edge cases and bugs. It integrates with unittest and pytest, shrinking failing examples to minimal reproducible cases. Widely adopted in production systems, Hypothesis boosts code reliability by exploring input spaces far beyond manually crafted tests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    PyFCM

    PyFCM

    Python client for FCM - Firebase Cloud Messaging

    Python client for FCM - Firebase Cloud Messaging (Android, iOS and Web) Firebase Cloud Messaging (FCM) is the new version of GCM. It inherits the reliable and scalable GCM infrastructure, plus new features. GCM users are strongly recommended to upgrade to FCM. Using FCM, you can notify a client app that new email or other data is available to sync.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    The NVIDIA Data Loading Library (DALI) is a library for data loading and pre-processing to accelerate deep learning applications. It provides a collection of highly optimized building blocks for loading and processing image, video and audio data. It can be used as a portable drop-in replacement for built-in data loaders and data iterators in popular deep learning frameworks. Deep learning applications require complex, multi-stage data processing pipelines that include loading, decoding,...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB