Showing 9 open source projects for "jpk data processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud Icon
    Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud

    Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.

    Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
    Try Cloud SQL Free
  • 1
    NYC Taxi Data

    NYC Taxi Data

    Import public NYC taxi and for-hire vehicle (Uber, Lyft)

    The nyc-taxi-data repository is a rich dataset and exploratory project around New York City taxi trip records. It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    tidytext

    tidytext

    Text mining using tidy tools

    tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    gtsummary

    gtsummary

    Presentation-Ready Data Summary and Analytic Result Tables

    gtsummary is an R package for creating elegant, customizable, publication-ready summary tables of datasets and statistical models. It provides concise code to produce demographic tables (tbl_summary()), regression result tables, and more, with flexible styling options for reporting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    mlr

    mlr

    Machine Learning in R

    R does not define a standardized interface for its machine-learning algorithms. Therefore, for any non-trivial experiments, you need to write lengthy, tedious, and error-prone wrappers to call the different algorithms and unify their respective output. {mlr} provides this infrastructure so that you can focus on your experiments! The framework provides supervised methods like classification, regression, and survival analysis along with their corresponding evaluation and optimization methods,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    TOFSIMS

    TOFSIMS

    R/Bioconductor toolkit for mass spectrometry data

    The tofsims project is an R/Bioconductor toolkit designed for processing, analyzing, and visualizing imaging mass spectrometry data from Time-of-Flight Secondary Ion Mass Spectrometry (ToF-SIMS) instruments. It supports importing raw and preprocessed data from popular instrument platforms (e.g. IONTOF, Ulvac-Phi) and provides methods for mass calibration, peak picking, and peak integration. The package allows transformation of spectra into 2D image structures (mass images), with operations such as binning, scaling, subsetting, and visual rendering. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    RNAseq Tutorial

    RNAseq Tutorial

    Informatics for RNA-seq: A web resource for analysis on the cloud

    ...The version in that repo is deprecated, but still maintains content for those wishing to follow the original published workflow. Includes instruction on cloud computing basics and using cloud environments for large data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    RStan

    RStan

    RStan, the R interface to Stan

    RStan is the R interface to Stan, a C++ library for statistical modeling and high-performance statistical computation. It lets users specify models in the Stan modeling language (for Bayesian inference), compile them, and perform inference from R. Key inference approaches include full Bayesian inference via Hamiltonian Monte Carlo (specifically the No-U-Turn Sampler, NUTS), approximate Bayesian inference via variational methods, and optimization (penalized likelihood). RStan integrates with...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    AnomalyDetection

    AnomalyDetection

    Anomaly Detection with R

    AnomalyDetection is an R package developed by Twitter for detecting anomalies in seasonal univariate time series. It implements the Seasonal Hybrid Extreme Studentized Deviate (S‑H‑ESD) test, which reliably identifies both global and local outliers in data with trends and seasonality—commonly applied to system metrics, engagement data, and business KPIs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ExData Plotting1

    ExData Plotting1

    Plotting Assignment 1 for Exploratory Data Analysis

    ...The dataset covers nearly four years of minute-level measurements, including power consumption, voltage, current intensity, and detailed sub-metering values for different household areas. For analysis, focus is placed on a two-day period in February 2007, highlighting short-term consumption trends. The data requires careful handling due to its size of more than 2 million rows and coded missing values. By processing the date and time fields into proper formats, it becomes possible to generate clear time-series plots of energy usage. The repository demonstrates effective exploratory data analysis practices in R with a reproducible workflow for transforming raw data into visual insights.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.