Showing 541 open source projects for "data"

View related business solutions
  • Cut Data Warehouse Costs up to 54% with BigQuery Icon
    Cut Data Warehouse Costs up to 54% with BigQuery

    Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

    BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.
    Try BigQuery Free
  • 99.99% Uptime for MySQL and PostgreSQL on Google Cloud Icon
    99.99% Uptime for MySQL and PostgreSQL on Google Cloud

    Enterprise Plus edition delivers sub-second maintenance downtime and 2x read/write performance. Built for critical apps.

    Cloud SQL Enterprise Plus gives you a 99.99% availability SLA with near-zero downtime maintenance—typically under 10 seconds. Get 2x better read/write performance, intelligent data caching, and 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server with built-in vector search for gen AI apps. New customers get $300 in free credit.
    Try Cloud SQL Free
  • 1
    Jupytab

    Jupytab

    Display in Tableau data from Jupyter notebooks

    Jupytab allows you to explore in Tableau data which is generated dynamically by a Jupyter Notebook. You can thus create Tableau data sources in a very flexible way using all the power of Python. This is achieved by having Tableau access data through a web server created by Jupytab.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    GPlates

    GPlates

    Interactive visualization of plate tectonics.

    GPlates is a plate-tectonics program. Manipulate reconstructions of geological and paleo-geographic features through geological time. Interactively visualize vector, raster and volume data. PyGPlates is the GPlates Python library. Get fine-grained access to GPlates functionality in your Python scripts.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 3
    Optimus

    Optimus

    Agile Data Preparation Workflows made easy with Pandas

    Easily write code to clean, transform, explore and visualize data using Python. Process using a simple API, making it easy to use for newcomers. More than 100 functions to handle strings, process dates, urls and emails. Easily plot data from any size. Out-of-box functions to explore and fix data quality. Use the same code to process your data in your laptop or in a remote cluster of GPUs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    repo2docker GitHub Action

    repo2docker GitHub Action

    A GitHub action to build data science environment images

    Trigger repo2docker to build a Jupyter enabled Docker image from your GitHub repository and push this image to a Docker registry of your choice. This will automatically attempt to build an environment from configuration files found in your repository. Images generated by this action are automatically tagged with both latest and <SHA> corresponding to the relevant commit SHA on GitHub. Both tags are pushed to the Docker registry specified by the user. If an existing image with the latest tag...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • 5
    MMdnn

    MMdnn

    Tools to help users inter-operate among deep learning frameworks

    MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML. MMdnn is a comprehensive and cross-framework tool to convert, visualize and diagnose deep learning (DL) models. The "MM" stands for model management, and "dnn" is the acronym of deep neural network. We implement a universal converter to convert DL models between frameworks,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Forecasting Best Practices

    Forecasting Best Practices

    Time Series Forecasting Best Practices & Examples

    ...Rather than creating implementations from scratch, we draw from existing state-of-the-art libraries and build additional utilities around processing and featuring the data, optimizing and evaluating models, and scaling up to the cloud. The examples and best practices are provided as Python Jupyter notebooks and R markdown files and a library of utility functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    StellarGraph

    StellarGraph

    Machine Learning on Graphs

    ...The StellarGraph library offers state-of-the-art algorithms for graph machine learning, making it easy to discover patterns and answer questions about graph-structured data. It can solve many machine learning tasks. Graph-structured data represent entities as nodes (or vertices) and relationships between them as edges (or links), and can include data associated with either as attributes. For example, a graph can contain people as nodes and friendships between them as links, with data like a person’s age and the date a friendship was established. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SageMaker Containers

    SageMaker Containers

    Create SageMaker-compatible Docker containers

    Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code. A container provides an effectively isolated environment, ensuring a consistent runtime and reliable training process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Spectral Python

    A python module for hyperspectral image processing

    Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 10
    Crystalsim -  XRD hkl simulation

    Crystalsim - XRD hkl simulation

    X-ray diffraction (XRD) analysis for hkl simulation of any crystal.

    Crystalsim is a simple freeware program with a neat graphical user interface for X-ray diffraction (XRD) data analysis . It can simulates all possible {hkl} planes data for the selected crystal. Crystallographic Information File (.cif) can also be used. Analyze both powder diffraction and single crystal data . Indexed at International Union of Crystallography (IUCR). Crystalline lattice parameters such as ‘a’, ‘b’, ‘c’ as well as interfacial angles such as alpha, beta, gamma can also be entered manually. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    QtiPlot
    QtiPlot is a user-friendly, platform independent data analysis and visualization application similar to the non-free Windows program Origin.
    Downloads: 56 This Week
    Last Update:
    See Project
  • 12
    An Open Source IEC 61131-3 Integrated Development Environment, providing PLCOpen SoftPLC programming, CanOpen IO's, and SVG based HMI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SCiDA Pro

    SCiDA Pro

    Program for solar cell production data analysis

    The purpose of the SCiDA Pro program is to help with processing solar cell production data. It has the following features: - Able to handle large data sets in a fast way (e.g. plotting 100k cell data takes a few seconds) - Easy data filtering - Easy generation of a data summary report - Extensive data plotting features - Cross-platform (Windows/Linux/MacOS) - Supports multiple languages
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    istSOS

    istSOS

    Free and Open Source Sensor Observation Service Data Management System

    istSOS is an OGC SOS server implementation written in Python. istSOS allows for managing and dispatch observations from monitoring sensors according to the Sensor Observation Service standard. The project provides also a Graphical user Interface that allows for easing the daily operations and a RESTful Web api for automatizing administration procedures. istSOS is released under the GPL License, and runs on all major platforms (Windows, Linux, Mac OS X), even though tests were conducted...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The TRANSIMS Studio application is an integrated development environment for the TRansportation ANalysis and SIMulation System (TRANSIMS). Components include a run time environment to execute TRANSIMS in parallel, as well as a full featured GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NYCOpenData-Profiling-Analysis

    NYCOpenData-Profiling-Analysis

    Open Data Profiling, Quality and Analysis on NYC OpenData dataset

    Open data often comes with little or no metadata. You will profile a large collection of open data sets and derive metadata that can be used for data discovery, querying, and identification of data quality problems. For each column, identify and summarize the semantic types present in the column. These can be generic types (e.g., city, state) or collection-specific types (NYU school names, NYC agency).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MCNPydE

    MCNPydE

    MCNP data extraction and display software library

    MCNPydE is a Python library for extracting data from MCNP output file. It requires Python, Matplotlib and Numpy. It is a data reduction tool for MCNP output for ease of results analysis and viewing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    NCVTK: A VTK-based tool to visualize data stored in the NetCDF file format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    abu

    abu

    Abu quantitative trading system (stocks, options, futures, bitcoin)

    Abu Quantitative Integrated AI Big Data System, K-Line Pattern System, Classic Indicator System, Trend Analysis System, Time Series Dimension System, Statistical Probability System, and Traditional Moving Average System conduct in-depth quantitative analysis of investment varieties, completely crossing the user's complex code quantification stage, more suitable for ordinary people to use, towards the era of vectorization 2.0.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    nonechucks

    nonechucks

    Deal with bad samples in your dataset dynamically

    ...Or what if your dataset is a folder full of scanned PDFs that you have to OCRize, and then run a language detector on the resulting text, because you want only the ones that are in English? Or maybe you have an AlternateIndexSampler, and you want to be able to move to dataset[6] after dataset[4] fails while attempting to load! PyTorch's data processing module expects you to rid your dataset of any unwanted or invalid samples before you feed them into its pipeline, and provides no easy way to define a "fallback policy" in case such samples are encountered during dataset iteration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Olex2 is visualisation software for small-molecule crystallography developed at Durham University/EPSRC. It provides comprehensive tools for crystallographic model manipulation for the end user and an extensible development framework for programmers. The project has been supported by Olexsys Ltd since 2010.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GEOMS2

    GEOMS2

    Geostatistics and geosciences modeling software

    GEOMS2 is a geostatistics and geosciences modeling software. Provides interface for grid (mesh), point, surface and data (non-spatial) objects. It has a 3D viewer and 2D plots using the well known Python engines Mayavi and Matplotlib. It has several functions to manipulate your data as well as provide univariate and multivariate analysis. NOTE: The software is still an early beta. Please tell us if you found a bug. Download datasets for students of Geostatistics 2017 (Petroleum Engineering): https://sourceforge.net/projects/geoms2/files/Geostatistics_Petroleum.zip/download Download datasets for students of Geostatistics 2017 (Geology and Mining Engineering): https://sourceforge.net/projects/geoms2/files/Geostatistics_Mining.zip/download Old: http://sourceforge.net/projects/geoms2/files/SETS_geoestatistica2_2015.7z/download https://sites.google.com/site/cmrpsoftware/downloads/Quarry_sets.7z?...
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    LaueTools

    LaueTools

    open source python packages for X-ray MicroLaue Diffraction analysis

    LaueTools is an open-source project for white beam Laue x-ray microdiffraction data analysis including tools in image processing, peaks searching & indexing, crystal structure solving (orientation & strain) and data & grain mapping visualisation. Python 3 Code and new features are now at: https://gitlab.esrf.fr/micha/lauetools
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    PyMOL Molecular Graphics System

    PyMOL Molecular Graphics System

    PyMOL is an OpenGL based molecular visualization system

    The Open-Source PyMOL repository has been moved to github: https://github.com/schrodinger/pymol-open-source We still use the pymol-users mailing list here on sourceforge. Please subscribe for community support: https://pymol.org/maillist (Note: SourceForge email newsletter and special offers are optional and can be unchecked) The PyMOL community wiki has its own home: https://pymolwiki.org/
    Downloads: 56 This Week
    Last Update:
    See Project
  • 25
    GeoNotebook

    GeoNotebook

    A Jupyter notebook extension for geospatial visualization and analysis

    GeoNotebook is an open-source extension to the Jupyter Notebook ecosystem that equips users with powerful geospatial visualization and analysis capabilities directly within the notebook interface. It integrates with GeoJS and other geospatial services to enable rich, interactive map rendering, layer control, and GIS data manipulation alongside traditional code and markdown cells in a Jupyter environment. Users can execute Python geospatial analysis and immediately visualize results on slippy web maps, allowing them to explore, annotate, and interpret large spatial datasets without leaving the notebook. GeoNotebook bridges the gap between data science workflows and GIS exploration by combining the flexibility of interactive notebooks with browser-based map display driven by a Python backend and WebGL/Canvas tools. ...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB