Showing 1471 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    ...Interactive data exploration for rapid qualitative analysis with clean visualizations. Graphic user interface allows you to focus on exploratory data analysis instead of coding, while clever defaults make fast prototyping of a data analysis workflow extremely easy. Place widgets on the canvas, connect them, load your datasets and harvest the insight!
    Downloads: 48 This Week
    Last Update:
    See Project
  • 2
    pandas

    pandas

    Fast, flexible and powerful Python data analysis toolkit

    pandas is a Python data analysis library that provides high-performance, user friendly data structures and data analysis tools for the Python programming language. It enables you to carry out entire data analysis workflows in Python without having to switch to a more domain specific language. With pandas, performance, productivity and collaboration in doing data analysis in Python can significantly increase. ...
    Downloads: 119 This Week
    Last Update:
    See Project
  • 3
    Cookiecutter Data Science

    Cookiecutter Data Science

    Project structure for doing and sharing data science work

    A logical, reasonably standardized, but flexible project structure for doing and sharing data science work. When we think about data analysis, we often think just about the resulting reports, insights, or visualizations. While these end products are generally the main event, it's easy to focus on making the products look nice and ignore the quality of the code that generates them. Because these end products are created programmatically, code quality is still important! ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Intelligent stock analysis system

    Intelligent stock analysis system

    LLM-driven A/H/US stock intelligent analyzer

    Intelligent stock analysis system is a Python-based smart stock analysis system that leverages large language models to automatically analyze selected equities across A-shares, Hong Kong stocks, and U.S. markets. It’s designed to produce a daily “decision dashboard” summarizing key insights such as core conclusions, precise entry/exit points, and checklists for potential trades, combining multi-dimensional technical analysis, market sentiment, chip distribution, and real-time price data.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    AI Data Science Team

    AI Data Science Team

    An AI-powered data science team of agents

    AI Data Science Team is a Python library and agent ecosystem designed to accelerate and automate common data science workflows by modeling them as specialized AI “agents” that can be orchestrated to perform tasks like data cleaning, transformation, analysis, visualization, and machine learning. It provides a modular agent framework where each agent focuses on a step in the typical data science pipeline — for example, loading data from CSV/Excel files, cleaning and wrangling messy datasets, engineering predictive features, building models with AutoML, connecting to SQL databases, and producing visual outputs — all driven by natural language or programmatic instructions. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Best-of Python

    Best-of Python

    A ranked list of awesome Python open-source libraries

    ...If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome! Ranked list of awesome python libraries for web development. Correctly generate plurals, ordinals, indefinite articles; convert numbers. Libraries for loading, collecting, and extracting data from a variety of data sources and formats. Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    geemap

    geemap

    A Python package for interactive geospaital analysis and visualization

    A Python package for interactive geospatial analysis and visualization with Google Earth Engine. Geemap is a Python package for geospatial analysis and visualization with Google Earth Engine (GEE), which is a cloud computing platform with a multi-petabyte catalog of satellite imagery and geospatial datasets. During the past few years, GEE has become very popular in the geospatial community and it has empowered numerous environmental applications at local, regional, and global scales. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    scikit-learn

    scikit-learn

    Machine learning in Python

    scikit-learn is an open source Python module for machine learning built on NumPy, SciPy and matplotlib. It offers simple and efficient tools for predictive data analysis and is reusable in various contexts.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 9
    AWS Data Wrangler

    AWS Data Wrangler

    Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.

    An AWS Professional Service open-source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data-related services. Easy integration with Athena, Glue, Redshift, Timestream, OpenSearch, Neptune, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON, and EXCEL). Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual ETL tasks like load/unload data from Data Lakes, Data Warehouses, and Databases. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    ydata-profiling

    ydata-profiling

    Create HTML profiling reports from pandas DataFrame objects

    ydata-profiling primary goal is to provide a one-line Exploratory Data Analysis (EDA) experience in a consistent and fast solution. Like pandas df.describe() function, that is so handy, ydata-profiling delivers an extended analysis of a DataFrame while allowing the data analysis to be exported in different formats such as html and json.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    LinkdAPI Python

    LinkdAPI Python

    Powerful, lightweight B2B Data API for enriching professional profiles

    LinkdAPI Python is a software development kit designed to simplify integration with the LinkdAPI platform, enabling developers to connect applications with external services through a unified API layer. It provides structured tools, libraries, and documentation that streamline authentication, request handling, and data exchange between systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    airda

    airda

    airda(Air Data Agent

    airda(Air Data Agent) is a multi-smart body for data analysis, capable of understanding data development and data analysis needs, understanding data, generating data-oriented queries, data visualization, machine learning and other tasks of SQL and Python codes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ArviZ.jl

    ArviZ.jl

    Exploratory analysis of Bayesian models with Julia

    ArviZ.jl (pronounced "AR-vees") is a Julia package for exploratory analysis of Bayesian models. It includes functions for posterior analysis, model checking, comparison and diagnostics.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    PyVista

    PyVista

    3D plotting and mesh analysis through a streamlined interface

    ...This module can be used for scientific plotting for presentations and research papers as well as a supporting module for other mesh-dependent Python modules. Easily integrate with NumPy and create a variety of geometries and plot them. You could use any geometry to create your glyphs, or even plot the points directly. Direct access to mesh analysis and transformation routines. Intuitive plotting routines with matplotlib similar syntax.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15
    leafmap

    leafmap

    A Python package for interactive mapping and geospatial analysis

    A Python package for geospatial analysis and interactive mapping in a Jupyter environment. Leafmap is a Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment. It is a spin-off project of the geemap Python package, which was designed specifically to work with Google Earth Engine (GEE).
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    Positron

    Positron

    Positron, a next-generation data science IDE

    Positron is a next-generation integrated development environment (IDE) created by Posit PBC (formerly RStudio Inc) specifically tailored for data science workflows in Python, R, and multi-language ecosystems. It aims to unify exploratory data analysis, production code, and data-app authoring in a single environment so that data scientists move from “question → insight → application” without switching tools. Built on the open-source Code-OSS foundation, Positron provides a familiar coding experience along with specialized panes and tooling for variable inspection, data-frame viewing, plotting previews, and interactive consoles designed for analytical work. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Sweetviz

    Sweetviz

    Visualize and compare datasets, target values and associations

    Sweetviz is an open-source Python library that generates beautiful, high-density visualizations to kickstart EDA (Exploratory Data Analysis) with just two lines of code. Output is a fully self-contained HTML application. The system is built around quickly visualizing target values and comparing datasets. Its goal is to help quick analysis of target characteristics, training vs testing data, and other such data characterization tasks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Astropy

    Astropy

    Repository for the Astropy core package

    ...The Anaconda Python Distribution includes Astropy and is the recommended way to install both Python and the Astropy package. The astropy package contains key functionality and common tools needed for performing astronomy and astrophysics with Python. It is at the core of the Astropy Project, which aims to enable the community to develop a robust ecosystem of affiliated packages covering a broad range of needs for astronomical research, data processing, and data analysis.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    reticulate

    reticulate

    R Interface to Python

    reticulate is an R package from Posit that creates seamless interoperability between R and Python. It lets you call Python modules, classes, and functions from within R, automatically translating between R and Python data structures. Useful for combining Python tooling with R projects, data analysis, and RMarkdown reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ValueCell

    ValueCell

    Community-driven, multi-agent platform for financial applications

    ValueCell is a community-driven multi-agent AI platform focused on financial research, analysis, and decision-making that lets users leverage multiple specialized AI agents for tasks like data retrieval, investment research, strategy execution, and market tracking. The system brings together a suite of collaborative agents—such as research agents that gather and interpret fundamentals, strategy agents that implement trading logic, and news agents that deliver personalized updates—to help users make more informed financial decisions across stocks, crypto, and other markets. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    UCP Python SDK

    UCP Python SDK

    The official Python SDK for UCP

    ...This SDK provides Pydantic models for UCP schemas, making it easy for Python developers to construct, validate, and serialize protocol messages and data structures according to the UCP specification. By adhering to the official protocol standards, applications built on this SDK can participate in tasks like capability discovery, checkout flows, order management, and more, while remaining interoperable across different UCP implementations and surfaces.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Google Spreadsheets Python

    Google Spreadsheets Python

    Google Sheets Python API

    gspread is a Python API for Google Sheets. A service account is a special type of Google account intended to represent a non-human user that needs to authenticate and be authorized to access data in Google APIs [sic]. Since it’s a separate account, by default it does not have access to any spreadsheet until you share it with this account. Just like any other Google account.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    FinMind

    FinMind

    Open Data, more than 50 financial data

    In the era of big data, data is the foundation of everything. We collect more than 50 kinds of Taiwan stock related information and provide download, online analysis, and backtesting. Regardless of the program, you can download data through the api provided by FinMind, or you can download data directly from the website. After data is available, statistical analysis, regression analysis, time series analysis, machine learning, and deep learning can be performed. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    MultivariateStats.jl

    MultivariateStats.jl

    A Julia package for multivariate statistics and data analysis

    A Julia package for multivariate statistics and data analysis (e.g. dimensionality reduction).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Panda-Helper

    Panda-Helper

    Panda-Helper: Data profiling utility for Pandas DataFrames and Series

    Panda-Helper is a simple data-profiling utility for Pandas DataFrames and Series. Assess data quality and usefulness with minimal effort. Quickly perform initial data exploration, so you can move on to more in-depth analysis.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB