609 projects for "python data analysis" with 2 filters applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Smallpond

    Smallpond

    A lightweight data processing framework built on DuckDB and 3FS

    ...Users write Python-like code (via DataFrame APIs or SQL strings) to express their transformations; behind the scenes, tasks are scheduled (often via Ray) and pushed into DuckDB instances operating on partitioned data. Because the storage layer (3FS) is optimized for random access and high throughput, smallpond can shuffle data, repartition, and manage intermediate results across nodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Google Cloud Dataflow Template Pipelines

    Google Cloud Dataflow Template Pipelines

    Cloud Dataflow Google-provided templates for solving data tasks

    ...Its structure shows support for multiple generations of templates, including v1 and v2 implementations, as well as related metadata, YAML assets, plugins, and Python components that support broader template execution and maintenance. This design makes the project more than a sample set, because it acts as the implementation base for official Google-provided templates used in real cloud data workflows.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    Ruff Visual Studio Code

    Ruff Visual Studio Code

    A Visual Studio Code extension with support for the Ruff linter

    Ruff Visual Studio Code is an official Visual Studio Code extension that integrates the Ruff Python linter and formatter directly into the development environment, enabling developers to maintain high code quality with minimal configuration. It provides real-time linting and formatting feedback as developers write code, helping catch errors, enforce style guidelines, and automatically fix issues where possible. The extension is built on top of Ruff’s high-performance Rust-based engine, which...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Piccolo

    Piccolo

    A fast, user friendly ORM and query builder which supports asyncio

    Piccolo is a modern, fast, and type-safe ORM for Python, designed with developer ergonomics in mind. It provides a clean syntax for defining schemas and building queries while supporting both sync and async execution. With built-in admin tools and rich introspection, Piccolo is suitable for web apps, APIs, and small-to-medium scale backends that prioritize clarity and speed.
    Downloads: 10 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Apache Spark

    Apache Spark

    A unified analytics engine for large-scale data processing

    ...With Spark Streaming (microbatches) and Structured Streaming, it delivers low-latency event processing suitable for real-time analytics. The built-in MLlib library provides scalable machine learning algorithms, while GraphX enables graph computations integrated with data pipelines. Spark supports multiple languages—Scala, Java, Python, R—and connects with many storage systems like HDFS, S3, Cassandra, and streaming platforms like Kafka, making it a versatile choice for big data workloads in analytics, ETL, and data science.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    tqdm

    tqdm

    A Fast, Extensible Progress Bar for Python and CLI

    tqdm is a fast, extensible progress bar for Python and CLI that enables you to see the progress of your loops in a clear and smart way. Simply wrap any iterable with tqdm(iterable), and sit back and watch that progress meter go! tqdm can be wrapped around any iterable, or executed as a module with pipes. Just by inserting tqdm (or python -m tqdm) between pipes will pass through all stdin to stdout while printing progress to stderr. tqdm does not require any dependencies, has a very...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Apache Sedona

    Apache Sedona

    Cluster computing framework for processing large-scale geospatial data

    ...According to our benchmark and third-party research papers, Sedona has 50% less peak memory consumption than other Spark-based geospatial data systems for large-scale in-memory query processing. Sedona offers Scala, Java, Spatial SQL, Python, and R APIs and integrates them into underlying system kernels with care. You can simply create spatial analytics and data mining applications and run them in any cloud environments.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    OpenTelemetry Collector distributions

    OpenTelemetry Collector distributions

    OpenTelemetry Collector Official Releases

    High-quality, ubiquitous, and portable telemetry to enable effective observability. OpenTelemetry is a collection of APIs, SDKs, and tools. Use it to instrument, generate, collect, and export telemetry data (metrics, logs, and traces) to help you analyze your software’s performance and behavior. Create and collect telemetry from your services and software, then forward it to a variety of analysis tools. OpenTelemetry integrates with many popular libraries and frameworks, and supports code-based and zero-code instrumentation.
    Downloads: 87 This Week
    Last Update:
    See Project
  • 9
    Odigos

    Odigos

    Distributed tracing without code changes

    Odigos supports any application written in Java, Python, .NET, Node.js and Go. Historically, compiled languages like Go have been difficult to instrument without code changes. Odigos solves this problem by uniquely leveraging eBPF. Odigos currently supports all the popular managed and open source destinations. By producing data in the OpenTelemetry format, Odigos can be used with any observability tool that supports OTLP.
    Downloads: 23 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    Perfetto

    Perfetto

    Production-grade client-side tracing, profiling, and analysis

    ...It’s designed around a low-overhead producer/consumer model: instrumented components (“producers”) write binary events into shared memory buffers and a collector (“service”) reliably streams them to storage. The data model spans kernel and userspace, so you can stitch together CPU scheduling, app lifecycles, binder/IPC hops, GPU work, power and thermal signals, file I/O, heap samples, and more into a single coherent timeline. Perfetto’s ecosystem includes a web-based UI that can load multi-GB traces directly in the browser and an offline “trace processor” that exposes the trace as a queryable SQL-like table schema for deep analysis and automation. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    Easy3D

    Easy3D

    Efficient library for processing 3D data

    Easy3D is a lightweight, easy-to-use, and efficient library for processing and rendering 3D data, implemented in C++ with Python bindings. It is designed for tasks such as 3D modeling, geometry processing, and rendering, emphasizing simplicity and efficiency. Easy3D serves as a valuable tool for research, education, and the development of sophisticated 3D applications, providing a solid foundation for handling 3D data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Frappe

    Frappe

    Low code web framework for real world applications

    Frappe is a full-stack, low-code web framework written in Python and JavaScript, used to build scalable and modular enterprise applications. It powers ERPNext and includes tools for REST APIs, user management, document modeling, workflows, and real-time updates. Frappe uses a "model-view-controller" approach with its own ORM and frontend system, enabling rapid development without sacrificing control or performance.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    OWL

    OWL

    Optimized Workforce Learning for General Multi-Agent Assistance

    OWL (Optimized Workforce Learning) is a sophisticated open-source framework built on the CAMEL-AI ecosystem for orchestrating teams of AI agents to collaboratively solve complex, real-world tasks with dynamic planning and automation capabilities. Unlike single-agent systems, it treats task completion as a collaborative workforce where agents take on specialized roles (planning, execution, analysis) and coordinate via a modular multi-agent architecture that supports flexible teamwork across...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Jupyter Docker Stacks

    Jupyter Docker Stacks

    Ready-to-run Docker images containing Jupyter applications

    ...Users can pull a particular stack image and launch a Jupyter server without worrying about installing Python, R, or complex dependencies themselves — everything needed is baked into the container. This makes the stacks especially useful for education, demos, collaborative coding, and CI/CD workflows where consistent environments are crucial, and it integrates smoothly with cloud platforms, JupyterHub deployments, and Binder for interactive sharing.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15
    DrissionPage

    DrissionPage

    Python based web automation tool. Powerful and elegant

    DrissionPage is a Python-based automation framework that blends the capabilities of Selenium for browser automation with Requests-HTML for fast, headless web data extraction. It enables seamless switching between browser-controlled and headless HTTP sessions within the same interface. Ideal for web scraping, testing, and automation, DrissionPage is lightweight and highly efficient, offering more flexibility than standard Selenium or Requests usage alone.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    SENAITE LIMS

    SENAITE LIMS

    SENAITE Meta Package

    SENAITE is a beautiful trigonal, oil-green to greenish-black crystal, with almost the hardness of a diamond. Although the crystal is described with a complex formula, it still has clear and straight shapes. Therefore, it reflects nicely the complexity of the LIMS, while providing a modern, intuitive, and friendly UI/ UX. Amongst other functionalities, SENAITE comes with highly-customizable workflows to drive users through the analytical process, easy-to-use UI for data registration,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Siddhi Core Libraries

    Siddhi Core Libraries

    Stream Processing and Complex Event Processing Engine

    Fully open source, cloud-native, scalable, micro streaming, and complex event processing system capable of building event-driven applications for use cases such as real-time analytics, data integration, notification management, and adaptive decision-making. Event processing logic can be written using Streaming SQL queries via graphical and source editors, to capture events from diverse data sources, process and analyze them, integrate with multiple services and data stores, and publish output to various endpoints in real time. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Barfi

    Barfi

    A Python visual Flow Based Programming library

    A Python visual Flow-Based Programming library that integrates into your existing workflow. Barfi is a Flow-Based Programming environment that provides a graphical programming interface. It is integratable into your existing Python workflows. A schema is built using barfi.Blocks. Then the schema is executed with barfi.ComputeEngine. Each barfi.Block has some properties that enable the FBP and schema building.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    WTForms

    WTForms

    A flexible forms validation and rendering library for Python

    WTForms is a flexible forms validation and rendering library for Python web development. It can work with whatever web framework and template engine you choose. It supports data validation, CSRF protection, internationalization (I18N), and more. There are various community libraries that provide closer integration with popular frameworks. WTForms is designed to work with any web framework and template engine.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    go1pylib

    go1pylib

    go1pylib is a Python library designed to control the Go1 robot

    go1pylib is a Python library designed to control the Go1 robot by Unitree Robotics. It provides an easy-to-use interface for robot movement, state management, collision avoidance, battery monitoring, and MQTT communication. Ideal for research and robotics development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    segyio

    segyio

    Fast Python library for SEGY files

    Segyio is a small LGPL-licensed C library for easy interaction with SEG-Y and Seismic Unix formatted seismic data, with language bindings for Python and Matlab. Segyio is an attempt to create an easy-to-use, embeddable, community-oriented library for seismic applications. Features are added as they are needed; suggestions and contributions of all kinds are very welcome.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    peepDB

    peepDB

    CLI tool and python library to inspect databases fast

    peepDB is an open-source command-line tool and Python library designed for developers and database administrators who need a fast and efficient way to inspect their database tables without writing SQL queries. With support for MySQL, PostgreSQL, and MariaDB, peepDB is lightweight, secure, and incredibly easy to use.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Polars

    Polars

    Dataframes powered by a multithreaded, vectorized query engine

    Polars is a high-performance, multi-language DataFrame library built in Rust using Apache Arrow. It delivers blazing-fast, vectorized, and parallel data manipulation with both eager and lazy execution, making it an excellent tool for data processing in Python, Rust, Node.js, R, and SQL contexts.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 25
    Mobly

    Mobly

    E2E test framework for tests with complex environment requirements

    Mobly is a Python-based test framework that specializes in supporting test cases that require multiple devices, complex environments, or custom hardware setups. P2P data transfer between two devices. Conference calls across three phones. Wearable device interacting with a phone. Internet-Of-Things devices interacting with each other. Testing RF characteristics of devices with special equipment.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB