609 projects for "python data analysis" with 2 filters applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    Scrapy is a fast, open source, high-level framework for crawling websites and extracting structured data from these websites. Portable and written in Python, it can run on Windows, Linux, macOS and BSD. Scrapy is powerful, fast and simple, and also easily extensible. Simply write the rules to extract the data, and add new functionality if you wish without having to touch the core. Scrapy does the rest, and can be used in a number of applications.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 2
    Deequ

    Deequ

    Deequ is a library built on top of Apache Spark

    ...It also includes a little domain-specific language called DQDL (Data Quality Definition Language) which allows declarative specification of quality rules. Users typically run Deequ before feeding data downstream (to ML pipelines, analytics, or production systems), enabling early detection and isolation of data errors. There is also a Python wrapper, PyDeequ, for users who prefer working from Python environments.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Pydantic-Core

    Pydantic-Core

    Core validation logic for pydantic written in rust

    pydantic-core is the Rust-based core validation logic for Pydantic, a widely used data validation library in Python. It offers significant performance improvements over its predecessor, enabling faster and more efficient data parsing and validation.​
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Earth Engine API

    Earth Engine API

    Python and JavaScript bindings for calling the Earth Engine API

    The Earth Engine API provides Python and JavaScript client libraries for Google Earth Engine, a planetary-scale geospatial analysis platform. With it, users compose lazy, server-side computations over massive catalogs of satellite imagery and vector datasets without handling raw files locally. The API exposes functional operators for map algebra, reducers, joins, and machine learning that scale transparently on Earth Engine’s backend.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Superduper

    Superduper

    Superduper: Integrate AI models and machine learning workflows

    Superduper is a Python-based framework for building end-2-end AI-data workflows and applications on your own data, integrating with major databases. It supports the latest technologies and techniques, including LLMs, vector-search, RAG, and multimodality as well as classical AI and ML paradigms. Developers may leverage Superduper by building compositional and declarative objects that out-source the details of deployment, orchestration versioning, and more to the Superduper engine. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    LangExtract

    LangExtract

    A Python library for extracting structured information

    LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    Kapacitor

    Kapacitor

    Open source framework for processing, monitoring, and alerting

    Open source framework for processing, monitoring, and alerting on time series data. Kapacitor is a real-time data processing engine for monitoring and alerting, specifically designed to work with time-series data from InfluxDB.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    dplyr

    dplyr

    dplyr: A grammar of data manipulation

    dplyr is an R package that provides a consistent and intuitive grammar for data manipulation, enabling users to filter, arrange, summarize, and transform data efficiently. Part of the tidyverse ecosystem, dplyr simplifies complex data operations through a clear and readable syntax, whether working with data frames, tibbles, or databases. It is widely used in data science and statistical analysis workflows.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    ConcurrentSim.jl

    ConcurrentSim.jl

    Discrete event process oriented simulation framework written in Julia

    A discrete event process-oriented simulation framework written in Julia inspired by the Python library SimPy. One of the longest-lived Julia packages (originally under the name SimJulia).
    Downloads: 6 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Peroxide

    Peroxide

    Rust numeric library with high performance and friendly syntax

    Rust numeric library contains linear algebra, numerical analysis, statistics and machine learning tools with R, MATLAB, Python-like macros. Peroxide uses a 1D data structure to represent matrices, making it straightforward to integrate with BLAS (Basic Linear Algebra Subprograms). This means that Peroxide can guarantee excellent performance for linear algebraic computations by leveraging the optimized routines provided by BLAS.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    redis-py

    redis-py

    Redis Python client

    redis-py is the official Python client for interacting with Redis, the in-memory data structure store. It supports all Redis commands and data types, making it easy to build caching, messaging, or real-time analytics features in Python applications. With both synchronous and asyncio support, redis-py is suited for modern Python projects and integrates smoothly into web frameworks, task queues, and backend services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    RAG Anything

    RAG Anything

    RAG-Anything: All-in-One RAG Framework

    ...Traditional RAG systems are typically limited to text and cannot effectively work across heterogeneous document layouts, but RAG-Anything addresses this by modeling multimodal content in ways that preserve cross-modal relationships and semantic context, often treating content elements as interconnected knowledge entities rather than separate data silos. The system uses a multi-stage pipeline (e.g., document parsing, content analysis, knowledge graph construction, intelligent retrieval) so queries can navigate across modalities with deeper understanding and relevance.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    PyPDF

    PyPDF

    A pure-python PDF library capable of splitting, merging, cropping

    pypdf is a pure Python library for working with PDF files, allowing developers to split, merge, rotate, encrypt, and extract content from PDFs. It’s an actively maintained fork of PyPDF2, improving performance, compatibility, and support for modern PDF standards. Suitable for both automation scripts and full-featured applications, pypdf handles PDFs without requiring external dependencies.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    Pysheeet

    Pysheeet

    Python Cheat Sheet

    Pysheeet is a community-driven collection of Python code snippets covering common patterns and tasks like sockets, file I/O, data structures, and more. Each snippet is concise and battle-tested, designed to save coding time and reduce boilerplate. With documentation hosted on Read the Docs and an active GitHub repo, it’s a go-to resource for Python developers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    BCC (BPF Compiler Collection)

    BCC (BPF Compiler Collection)

    Tools for BPF-based Linux IO analysis, networking, monitoring, etc.

    BCC is a toolkit that simplifies creating efficient kernel tracing, monitoring, and manipulation programs by leveraging extended Berkeley Packet Filters (eBPF). It includes a rich set of example tools and scripting interfaces in C, Python, and Lua. BCC makes BPF programs easier to write, with kernel instrumentation in C (and includes a C wrapper around LLVM), and front-ends in Python and lua. It is suited for many tasks, including performance analysis and network traffic control. With a BPF-specific frontend, one should be able to write in a language and receive feedback from the compiler on the validity as it pertains to a BPF backend. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Stlite

    Stlite

    n-browser Streamlit

    Stlite is a WebAssembly-powered framework that enables Streamlit applications to run entirely in the browser without requiring a Python backend server. It achieves this by using Pyodide, a WebAssembly-based Python runtime, to execute Python code directly within the browser environment. This allows developers to build interactive data applications using familiar Streamlit APIs while eliminating the need for server-side infrastructure. Stlite supports a wide range of Python libraries and enables real-time interactivity, making it suitable for dashboards, data visualization tools, and educational applications. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Yoast WordPress SEO

    Yoast WordPress SEO

    Yoast SEO for WordPress

    The Yoast SEO plugin is the most popular SEO tool for WordPress, offering comprehensive tools to optimize content for search engines. It provides real-time page analysis, readability checks, and automated metadata handling to improve website visibility.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    DBOS Transact PY

    DBOS Transact PY

    Lightweight Durable Python Workflows

    dbos-transact-py is the Python counterpart to dbos-transact-ts, offering durable transactional programming with automatic state persistence in PostgreSQL. It simplifies building resilient and idempotent applications by enabling Python functions to retain their state, restart after failure, and guarantee consistency. It's designed for data-heavy and fault-intolerant use cases.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    PyExcelerate

    PyExcelerate

    Accelerated Excel XLSX Writing Library for Python 2/3

    Accelerated Excel XLSX writing library for Python. PyExcelerate is a Python for writing Excel-compatible XLSX spreadsheet files, with an emphasis on speed.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Pants Build System

    Pants Build System

    The Pants Build System

    Pants 2 is a fast, scalable, user-friendly build system for codebases of all sizes. It's currently focused on Python, Go, Java, Scala, Kotlin, Shell, and Docker, with support for other languages and frameworks coming soon. A lot of effort has gone into making Pants easy to adopt, easy to use and easy to extend. We're super excited to bring Pants' distinctive features to Go, Java, Python, Scala, Kotlin, and Shell users. Pants requires very minimal BUILD file metadata/boilerplate. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    Netcap

    Netcap

    A framework for secure and scalable network traffic analysis

    The Netcap (NETwork CAPture) framework efficiently converts a stream of network packets into platform-neutral type-safe structured audit records that represent specific protocols or custom abstractions. These audit records can be stored on disk or exchanged over the network, and are well-suited as a data source for machine learning algorithms. Since parsing of untrusted input can be dangerous and network data is potentially malicious, a programming language that provides a garbage-collected...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 22
    Parsera

    Parsera

    Lightweight library for scraping web-sites with LLMs

    Scrape data from any website with only a link and column descriptions. Parsera is a tool designed to scrape web content, specifically handling poorly structured or messy websites.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    NiceGUI

    NiceGUI

    Create web-based user interfaces with Python

    NiceGUI is a Python-based UI framework that enables developers to create interactive web applications using only Python code. It abstracts away the complexities of HTML, CSS, and JavaScript, allowing for rapid development of web interfaces directly from Python scripts. NiceGUI is suitable for building dashboards, control panels, and other web-based tools, especially in contexts like robotics and data visualization.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    DynamicalSystems.jl

    DynamicalSystems.jl

    Award winning software library for nonlinear dynamics timeseries

    DynamicalSystems.jl is an award-winning Julia software library for nonlinear dynamics and nonlinear time series analysis. To install DynamicalSystems.jl, run import Pkg; Pkg.add("DynamicalSystems"). To learn how to use it and see its contents visit the documentation, which you can either find online or build locally by running the docs/make.jl file. DynamicalSystems.jl is part of JuliaDynamics, an organization dedicated to creating high-quality scientific software. All implemented algorithms...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    ModernGL

    ModernGL

    Modern OpenGL binding for Python

    ModernGL is a Python wrapper over OpenGL, designed to simplify the creation of high-performance, modern graphics applications. It provides an intuitive API for rendering 2D and 3D graphics, making it accessible to both beginners and experienced developers. ModernGL is suitable for applications such as games, simulations, and data visualizations.
    Downloads: 6 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB