Showing 1471 open source projects for "python data analysis"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1

    DEPRECATED - KVFinder

    Cavity Detection PyMOL plugin

    ...Please read and cite the original paper ParKVFinder: A thread-level parallel approach in biomolecular cavity detection (10.1016/j.softx.2020.100606). [pyKVFinder] pyKVFinder is available in this Python Package Index (PyPI) repository, https://pypi.org/project/pyKVFinder and this GitHub repository, https://github.com/LBC-LNBio/pyKVFinder. Please read and cite the original paper pyKVFinder: an efficient and integrable Python package for biomolecular cavity detection and characterization in data science (10.1186/s12859-021-04519-4).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenFrames

    OpenFrames

    Real-time interactive 3D graphics API for scientific simulations

    OpenFrames has moved its primary development repository to GitHub! Everything else will follow. Get it at https://github.com/ravidavi/OpenFrames/wiki OpenFrames is an Application Programming Interface (API) that allows developers to provides the ability to add interactive 3D graphics to any scientific simulation. A simulation developer can use OpenFrames to specify what they want to visualize, without having to know any details of computer graphics programming. OpenFrames is currently...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Davinci

    Davinci

    Davinci is a DVsaaS (Data Visualization as a Service) Platform

    Davinci is oriented towards product managers, business people, data engineers, data analysts, data scientists, etc. It aims to provide a one-stop data visualization solution, which could be both independently used as a public cloud/private cloud and integrated into third-party systems as plugin. A simple configuration on Davinci UI can meet multiple visualization requirements. It also supports other visualization features like advanced interaction, industry analysis, pattern searching, social intelligence, etc. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Reproducible-research

    Reproducible-research

    A Reproducible Data Analysis Workflow with R Markdown, Git, Make, etc.

    ...The workflow ensures meeting the primary goals that 1) the reporting of statistical results is consistent with the actual statistical results (dynamic report generation), 2) the analysis exactly reproduces at a later point in time even if the computing platform or software is changed (computational reproducibility), and 3) changes at any time (during development and post-publication) are tracked, tagged, and documented while earlier versions of both data and code remain accessible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Zipline

    Zipline

    Zipline, a Pythonic algorithmic trading library

    ...Zipline is currently used in production as the backtesting and live-trading engine powering Quantopian -- a free, community-centered, hosted platform for building and executing trading strategies. Quantopian also offers a fully managed service for professionals that includes Zipline, Alphalens, Pyfolio, FactSet data, and more. Installing Zipline is slightly more involved than the average Python package. For a development installation (used to develop Zipline itself), create and activate a virtualenv, then run the etc/dev-install script. Please note that Zipline is not a community-led project. Zipline is maintained by the Quantopian engineering team, and we are quite small and often busy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Amazon SageMaker Examples

    Amazon SageMaker Examples

    Jupyter notebooks that demonstrate how to build models using SageMaker

    ...They have the familiar Jupyter and JuypterLab interfaces that work well for single users, or small teams where users are also administrators. Advanced users also use SageMaker solely with the AWS CLI and Python scripts using boto3 and/or the SageMaker Python SDK.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Jupytab

    Jupytab

    Display in Tableau data from Jupyter notebooks

    Jupytab allows you to explore in Tableau data which is generated dynamically by a Jupyter Notebook. You can thus create Tableau data sources in a very flexible way using all the power of Python. This is achieved by having Tableau access data through a web server created by Jupytab.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    WhyLogs Java Library

    WhyLogs Java Library

    Profile and monitor your ML data pipeline end-to-end

    This is a Java implementation of WhyLogs, with support for Apache Spark integration for large scale datasets. Understanding the properties of data as it moves through applications is essential to keeping your ML/AI pipeline stable and improving your user experience, whether your pipeline is built for production or experimentation. WhyLogs is an open source statistical logging library that allows data science and ML teams to effortlessly profile ML/AI pipelines and applications, producing log files that can be used for monitoring, alerts, analytics, and error analysis. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    covid-chestxray-dataset

    covid-chestxray-dataset

    We are building an open database of COVID-19 cases with chest X-ray

    To build a public open dataset of chest X-ray and CT images of patients who are positive or suspected of COVID-19 or other viral and bacterial pneumonia (MERS, SARS, and ARDS.). Data will be collected from public sources as well as through indirect collection from hospitals and physicians. All images and data will be released publicly in this GitHub repo. This project is approved by the University of Montreal's Ethics Committee #CERSES-20-058-D. We can extract images from publications. Help...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Pretty Damn Quick (PDQ) analytically solves queueing network models of computer and manufacturing systems, data networks, etc., written in conventional programming languages. Generic or customized reports of predicted performance measures are output.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Helpdezk

    Helpdezk

    HelpDEZk is a powerfull software that manages requests/incidents.

    HelpDEZk is a powerfull software that manages requests/incidents. It has all the needed requirements to an efficient workflow management of all processes involved in service execution. This control is done for internal demands and also for outsourced services. HelpDEZk can be used at any company’s area, serving as an support to the shared service center concept, beyond the ability to log all the processes and maintain the request’s history, it can pass it through many approval...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GPlates

    GPlates

    Interactive visualization of plate tectonics.

    GPlates is a plate-tectonics program. Manipulate reconstructions of geological and paleo-geographic features through geological time. Interactively visualize vector, raster and volume data. PyGPlates is the GPlates Python library. Get fine-grained access to GPlates functionality in your Python scripts.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 13

    QuiclickRNAseq

    An integrated pipeline for RNA-seq analysis

    Quiclick is a python framework for RNA-seq data analysis, including data trimming, mapping, counting and RNA editing analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OzGIS free open-source  mapping system

    OzGIS free open-source mapping system

    Analysis and display of Census, business, government attribute data

    OzGIS is an extensive mapping system for the analysis and display of geographically referenced data. Map data are downloaded from Census bureaux and map agencies or extracted from your own databases. The system can be used to to support management decisions associated with, for example, government planning, marketing, sales, site and personnel location, advertising and research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    TimothyDocs

    TimothyDocs

    Timothy is a cloud base storage system designed to document your work

    Timothy is a cloud based documentation system. Timothy will document any endeavor because it will store not only the documents created during the project but also store information about those files. Like most storge schemes timothy creates a hierarchy of categories through which one may browse. Timothy displays information about the document or category as well as its name. This use of meta data explains the structure and content of the project to the user as he browses. Users...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Limada
    Limada (Linked Marked Data) is a framework for managing relationships of data in non-hierarchical structures, where the individual elements are interconnected in complex ways. The philosophy behind: Information is connection, is relationship of data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AlphaPy

    AlphaPy

    Python AutoML for Trading Systems and Sports Betting

    AlphaPy is a Python-based AutoML framework tailored for trading systems and sports betting applications. Built on popular libraries like scikit-learn and pandas, it enables data scientists and speculators to craft predictive models, ensemble strategies, and automated forecasting systems with minimal setup. Run machine learning models using scikit-learn, Keras, xgboost, LightGBM, and CatBoost.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Optimus

    Optimus

    Agile Data Preparation Workflows made easy with Pandas

    Easily write code to clean, transform, explore and visualize data using Python. Process using a simple API, making it easy to use for newcomers. More than 100 functions to handle strings, process dates, urls and emails. Easily plot data from any size. Out-of-box functions to explore and fix data quality. Use the same code to process your data in your laptop or in a remote cluster of GPUs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    StellarGraph

    StellarGraph

    Machine Learning on Graphs

    StellarGraph is a Python library for machine learning on graphs and networks. The StellarGraph library offers state-of-the-art algorithms for graph machine learning, making it easy to discover patterns and answer questions about graph-structured data. It can solve many machine learning tasks. Graph-structured data represent entities as nodes (or vertices) and relationships between them as edges (or links), and can include data associated with either as attributes. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    DBSolveOptimum

    DBSolveOptimum

    The software for systems biology/pharmacology modeling

    DBSolveOptimum is a stand-alone software tool for the construction and analysis of mathematical models of biological systems. It is a succession and extension of DBSolve 5 and DBSolve 7, which were popular tools for systems biology. In addition to the capabilities of these software packages, DBSolveOptimum is implemented with new tools for extended data analysis and multiple simulations, which are important for simulation of virtual clinical trials and application of modern modeling techniques, like quantitative systems pharmacology, to problems arising in drug research and development. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Spectral Python

    A python module for hyperspectral image processing

    Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Forecasting Best Practices

    Forecasting Best Practices

    Time Series Forecasting Best Practices & Examples

    ...Rather than creating implementations from scratch, we draw from existing state-of-the-art libraries and build additional utilities around processing and featuring the data, optimizing and evaluating models, and scaling up to the cloud. The examples and best practices are provided as Python Jupyter notebooks and R markdown files and a library of utility functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    repo2docker GitHub Action

    repo2docker GitHub Action

    A GitHub action to build data science environment images

    Trigger repo2docker to build a Jupyter enabled Docker image from your GitHub repository and push this image to a Docker registry of your choice. This will automatically attempt to build an environment from configuration files found in your repository. Images generated by this action are automatically tagged with both latest and <SHA> corresponding to the relevant commit SHA on GitHub. Both tags are pushed to the Docker registry specified by the user. If an existing image with the latest tag...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    RapidMiner -- Data Mining, ETL, OLAP, BI
    ETL, data warehousing, data mining, OLAP, business intelligence (BI) in Java. 500+ modules: extract, transform, load (ETL), data mining, data analysis + Weka, statistical forecasting, preprocessing, validation, visualization, OLAP, business intelligence.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 25
    Poli

    Poli

    An easy-to-use BI server built for SQL lovers. Power data analysis

    An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights. Platform independent web application. Single JAR file + Single SQLite DB file. Get up and running in 5 minutes. PostgreSQL, Oracle, SQL Server, MySQL, Elasticsearch... You name it. No ETLs, no generated SQL, polish your own SQL query to transform data. Pixel-perfect positioning + Drag'n'Drop support to customize the reports and charts in your own way.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB