Showing 1471 open source projects for "python data analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Data Preprocessing Automate

    Data Preprocessing Automate

    Data Preprocessing Automation: A GUI for easy data cleaning & visualiz

    Data Preprocessing Automation is a Python-based GUI application designed to simplify and automate data preprocessing tasks. It allows users to upload Excel files, automatically handle missing values, remove duplicates, and detect and remove outliers using statistical methods. The application provides data visualization tools, including box plots for distribution analysis and scatter plots for exploring relationships between variables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenCL.jl

    OpenCL.jl

    OpenCL Julia bindings

    Julia interface for the OpenCL parallel computation API. This package aims to be a complete solution for OpenCL programming in Julia, similar in scope to PyOpenCL for Python. It provides a high level API for OpenCL to make programing hardware accelerators, such as GPUs, FPGAs, and DSPs, as well as multicore CPUs much less onerous.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    city-roads

    city-roads

    Visualization of all roads within any city

    city-roads is a data visualization and mapping project that renders street networks of cities as vector paths, offering an interactive, zoomable experience that highlights how cities are stitched together by their road infrastructure. It typically fetches open map data (such as from OpenStreetMap) and then processes that data into geometry suited for rendering in the browser, allowing users to explore intricate road layouts at various scales. The visual style often abstracts away labels and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Data Envelopment Analysis using Stata

    Data Envelopment Analysis using Stata

    Develop a user written Data Envelopment Analysis package in Stata.

    .... * Book "A Handbook of Data Envelopment Analysis using Stata" published in Amazon : https://www.amazon.com/-/ko/dp/B0FH6N3168/ref=tmm_hrd_swatch_0
    Downloads: 7 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    CUDOS Framework

    CUDOS Framework

    Command Line Interface tool for Cloud Intelligence Dashboards

    The AWS Cloud Intelligence Dashboards Framework is a set of open-source tools and templates designed to help organizations deploy and manage advanced data visualization dashboards that offer insights into cost, usage, governance, and operational health across AWS environments. It is part of the AWS Solutions Library and includes CloudFormation templates, CLI commands, and pre-built dashboards that collect, process, and visualize data from AWS billing, cost management, budgets, and usage...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    XGBoost

    XGBoost

    Scalable and Flexible Gradient Boosting

    ...XGBoost works by implementing machine learning algorithms under the Gradient Boosting framework. It also offers parallel tree boosting (GBDT, GBRT or GBM) that can quickly and accurately solve many data science problems. XGBoost can be used for Python, Java, Scala, R, C++ and more. It can run on a single machine, Hadoop, Spark, Dask, Flink and most other distributed environments, and is capable of solving problems beyond billions of examples.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 7
    JDF.jl

    JDF.jl

    Julia DataFrames serialization format

    ...JDF.jl is a pure-Julia solution and there are a lot of ways to do nifty things like compression and encapsulating the underlying struture of the arrays that's hard to do in R and Python. E.g. Python's numpy arrays are C objects, but all the vector types used in JDF are Julia data types.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    DataHub

    DataHub

    The Metadata Platform for your Data and AI Stack

    DataHub is an open source metadata platform that helps organizations discover, understand, and trust their data assets at scale. It models data as a richly connected graph spanning datasets, dashboards, pipelines, ML features, and services, so users can explore relationships like lineage and ownership across tools and domains. The platform focuses on continuous metadata ingestion from many sources, treating metadata as a stream that stays fresh as systems change. A modern web UI and search...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Beads Viewer

    Beads Viewer

    Task management system for coding agents

    Beads Viewer is a visualization tool designed to display, inspect, and interact with “bead plots,” graphical representations often used in statistical analysis, time series exploration, or clustering visualizations where data points are represented as beads on strands or axes to emphasize patterns and relationships. The viewer provides an intuitive graphical interface that automatically renders bead plots from structured data inputs, enabling users to observe trends at a glance, highlight clusters, or drill down into individual sequences. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Azure Cost CLI

    Azure Cost CLI

    CLI tool to perform cost analysis on your Azure subscription

    azure-cost-cli is a command-line tool that simplifies cost analysis for Microsoft Azure subscriptions by retrieving, summarizing, and visualizing spending data directly from the CLI, helping cloud operators and finance teams understand where resources are generating costs. Users can run the tool against their Azure account to output cost breakdowns by service, resource group, tag, or timeframe, enabling quick identification of costly services or budget overruns without needing to navigate the Azure portal. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Sundials.jl

    Sundials.jl

    Julia interface to Sundials, including a nonlinear solver

    This is a suite for numerically solving differential equations written in Julia and available for use in Julia, Python, and R. The purpose of this package is to supply efficient Julia implementations of solvers for various differential equations.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    D-Tale

    D-Tale

    Visualizer for pandas data structures

    D-Tale is the combination of a Flask backend and a React front-end to bring you an easy way to view & analyze Pandas data structures. It integrates seamlessly with ipython notebooks & python/ipython terminals. Currently, this tool supports such Pandas objects as DataFrame, Series, MultiIndex, DatetimeIndex & RangeIndex. D-Tale was the product of a SAS to Python conversion. What was originally a perl script wrapper on top of SAS's insight function is now a lightweight web client on top of Pandas data structures. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    borb

    borb

    borb is a library for reading, creating and manipulating PDF files

    borb is a library for creating and manipulating PDF files in python. borb is a pure python library to read, write, and manipulate PDF documents. It represents a PDF document as a JSON-like data structure of nested lists, dictionaries and primitives (numbers, string, booleans, etc) This is currently a one-man project, so the focus will always be to support those use-cases that are more common in favor of those that are rare.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    Circuitscape.jl

    Circuitscape.jl

    Algorithms from circuit theory to predict connectivity

    Circuitscape is an open-source program that uses circuit theory to model connectivity in heterogeneous landscapes. Its most common applications include modeling the movement and gene flow of plants and animals, as well as identifying areas important for connectivity conservation. The new Circuitscape is built entirely in the Julia language, a new programming language for technical computing. Julia is built from the ground up to be fast. As such, this offers a number of advantages over the...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Salt

    Salt

    Automate the management and configuration of infrastructures at scale

    Salt Bootstrap is a shell script that detects the target platform and selects the best installation method. (Supported Platforms) Your Salt master can manage itself, so a Salt minion is installed along with the Salt master. Before implementing Salt to manage any data center infrastructure it is important to understand a few things about Salt platform support including: What platforms will the Salt Master run on? What systems and infrastructure can be managed by a Salt Minion? Salt runs on...
    Downloads: 63 This Week
    Last Update:
    See Project
  • 16
    CocoIndex

    CocoIndex

    ETL framework to index data for AI, such as RAG

    CocoIndex is an open-source framework designed for building powerful, local-first semantic search systems. It lets users index and retrieve content based on meaning rather than keywords, making it ideal for modern AI-based search applications. CocoIndex leverages vector embeddings and integrates with various models and frameworks, including OpenAI and Hugging Face, to provide high-quality semantic understanding. It’s built for transparency, ease of use, and local control over your search...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 17
    JUDI.jl

    JUDI.jl

    Julia Devito inversion

    JUDI is a framework for large-scale seismic modeling and inversion and is designed to enable rapid translations of algorithms to fast and efficient code that scales to industry-size 3D problems. The focus of the package lies on seismic modeling as well as PDE-constrained optimization such as full-waveform inversion (FWI) and imaging (LS-RTM). Wave equations in JUDI are solved with Devito, a Python domain-specific language for automated finite-difference (FD) computations. JUDI's modeling...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Fondant

    Fondant

    Production-ready data processing made easy and shareable

    Fondant is a modular, pipeline-based framework designed to simplify the preparation of large-scale datasets for training machine learning models, especially foundation models. It offers an end-to-end system for ingesting raw data, applying transformations, filtering, and formatting outputs—all while remaining scalable and traceable. Fondant is designed with reproducibility in mind and supports containerized steps using Docker, making it easy to share and reuse data processing components....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Siddhi Core Libraries

    Siddhi Core Libraries

    Stream Processing and Complex Event Processing Engine

    Fully open source, cloud-native, scalable, micro streaming, and complex event processing system capable of building event-driven applications for use cases such as real-time analytics, data integration, notification management, and adaptive decision-making. Event processing logic can be written using Streaming SQL queries via graphical and source editors, to capture events from diverse data sources, process and analyze them, integrate with multiple services and data stores, and publish output to various endpoints in real time. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Awesome-Quant

    Awesome-Quant

    A curated list of insanely awesome libraries, packages and resources

    awesome-quant is a curated list (“awesome list”) of libraries, packages, articles, and resources for quantitative finance (“quants”). It includes tools, frameworks, research papers, blogs, datasets, etc. It aims to help people working in algorithmic trading, quant investing, financial engineering, etc., find useful open source or educational resources. Licensed under typical “awesome” list standards.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ImportExcel

    ImportExcel

    PowerShell module to import/export Excel spreadsheets, without Excel

    ...It exposes straightforward cmdlets like Import-Excel and Export-Excel that convert between Excel sheets and PowerShell objects, making it simple to pipeline tabular data into reporting and automation flows. Advanced features include adding and formatting tables, setting number/date formats, creating charts, and applying styling or conditional formatting programmatically. The module is optimized for performance (streaming where possible) and supports large datasets, making it useful for ETL tasks, automated reporting, and data analysis in pure PowerShell environments. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 22
    gramm

    gramm

    Gramm is a complete data visualization toolbox for Matlab

    Gramm is a MATLAB toolbox that enables the rapid creation of complex, publication-quality figures. Its design philosophy focuses on a declarative approach, where users specify the desired end result, as opposed to the traditional imperative method involving for loops, if/else statements, etc. The MATLAB implementation of gramm is inspired by the "grammar of graphics" principles (Wilkinson 1999) and the ggplot2 library for R by Hadley Wickham. As a reference to this inspiration, gramm stands...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    dlib

    dlib

    Toolkit for making machine learning and data analysis applications

    Dlib is a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems. It is used in both industry and academia in a wide range of domains including robotics, embedded devices, mobile phones, and large high performance computing environments. Dlib's open source licensing allows you to use it in any application, free of charge. Good unit test coverage, the ratio of unit test lines of code to library lines of code is...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 24
    JavaParser

    JavaParser

    Java 1-17 Parser and Abstract Syntax Tree for Java

    This project contains a set of libraries implementing a Java 1.0 - Java 17 Parser with advanced analysis functionalities. The project binaries are available in Maven Central. We strongly advise users to adopt Maven, Gradle or another build system for their projects. If you are not familiar with them we suggest taking a look at the maven quickstart projects. Since Version 3.5.10, the JavaParser project includes the JavaSymbolSolver. While JavaParser generates an Abstract Syntax Tree,...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 25
    FiniteDifferences.jl

    FiniteDifferences.jl

    High accuracy derivatives, estimated via numerical finite differences

    FiniteDifferences.jl estimates derivatives with finite differences. See also the Python package FDM. FiniteDiff.jl and FiniteDifferences.jl are similar libraries: both calculate approximate derivatives numerically. You should definitely use one or the other, rather than the legacy Calculus.jl finite differencing, or reimplementing it yourself. At some point in the future, they might merge, or one might depend on the other.
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB