Showing 22 open source projects for "data"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • Easily Host LLMs and Web Apps on Cloud Run Icon
    Easily Host LLMs and Web Apps on Cloud Run

    Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
    Try Cloud Run Free
  • 1
    data.table

    data.table

    Extends base R’s data for high-performance data manipulation

    data.table is an R package that extends base R’s data.frame for high-performance data manipulation. It offers concise syntax, blazing speed, and memory-efficient operations. It supports fast file reading/writing, joins, grouping, reshaping, and updates by reference. It is heavily used in large data workflows, big data in R, production pipelines, etc. Extremely efficient grouping/aggregation/summarization; can handle very large datasets (hundreds of millions to billions of rows) in memory (if available). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    NYC Taxi Data

    NYC Taxi Data

    Import public NYC taxi and for-hire vehicle (Uber, Lyft)

    The nyc-taxi-data repository is a rich dataset and exploratory project around New York City taxi trip records. It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    dplyr

    dplyr

    dplyr: A grammar of data manipulation

    dplyr is an R package that provides a consistent and intuitive grammar for data manipulation, enabling users to filter, arrange, summarize, and transform data efficiently. Part of the tidyverse ecosystem, dplyr simplifies complex data operations through a clear and readable syntax, whether working with data frames, tibbles, or databases. It is widely used in data science and statistical analysis workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    RStudio Cheatsheets

    RStudio Cheatsheets

    Curated collection of official cheat sheets for data science tools

    ...It covers topics such as data wrangling, data import, modeling, visualization, RStudio IDE shortcuts, Shiny development, and the tidyverse suite (dplyr, ggplot2, tidyr, purrr). These cheat sheets are widely used by R learners, educators, and practitioners as quick reference tools, and they often ship with RStudio by default or are linked from RStudio’s help/documentation pages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 in Free Credit for Your Google Cloud Projects Icon
    $300 in Free Credit for Your Google Cloud Projects

    Build, test, and explore on Google Cloud with $300 in free credit. No hidden charges. No surprise bills.

    Launch your next project with $300 in free Google Cloud credit—no hidden charges. Test, build, and deploy without risk. Use your credit across the Google Cloud platform to find what works best for your needs. After your credits are used, continue building with free monthly usage products. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    Shiny

    Shiny

    Build interactive web apps directly from R with Shiny framework

    Shiny is an R package from RStudio that enables users to build interactive web applications using R without requiring knowledge of JavaScript, HTML, or CSS. It allows statisticians and data scientists to turn their analyses into fully functional web dashboards with reactive elements, data inputs, visualizations, and controls, making data communication more effective and dynamic.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    plotly

    plotly

    An interactive graphing library for R

    This part of the book teaches you how to leverage the plotly R package to create a variety of interactive graphics. There are two main ways to creating a plotly object: either by transforming a ggplot2 object (via ggplotly()) into a plotly object or by directly initializing a plotly object with plot_ly()/plot_geo()/plot_mapbox(). Both approaches have somewhat complementary strengths and weaknesses, so it can pay off to learn both approaches. Moreover, both approaches are an implementation of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    R Color Palettes

    R Color Palettes

    Comprehensive list of color palettes available in R

    This repository is a curated collection of color palettes crafted or curated for data visualization in R. The goal is to provide designers, data scientists, and R users with aesthetically pleasing, perceptually consistent color schemes that work well for plots, maps, and graphics. The repo contains static files listing palette definitions (e.g. hex codes, named hues), sample visualizations showing how each palette performs under different contexts (categorical, sequential, diverging), and helper functions/scripts to import or use the palettes in R. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    magrittr

    magrittr

    Improve the readability of R code with the pipe

    magrittr introduces the pipe operator (%>%) and related functional utilities into R. It underlies the powerful piped syntax widely adopted in tidyverse workflows by enabling left-hand argument passing and providing helpers like compound assignment pipes and exposition pipes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    purrr

    purrr

    A functional programming toolkit for R

    purrr enhances R’s functional programming capabilities by providing a consistent set of tools for working with lists and vectors, enabling safer and more expressive iteration compared to base R’s loop functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Awesome Network Analysis

    Awesome Network Analysis

    A curated list of awesome network analysis resources

    ...It covers multiple programming languages and domains like sociology, biology, and computer science. This repository serves as a central reference for researchers, analysts, and developers working with network data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    blogdown

    blogdown

    Create Blogs and Websites with R Markdown

    ...Developed by Yihui Xie and team, it provides functions to initialize sites, write posts, manage themes, and deploy with minimal fuss. It seamlessly blends R code chunks and web content, ideal for data storytellers and technical bloggers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Statistical Rethinking 2024

    Statistical Rethinking 2024

    This course teaches data analysis

    The 2024 repository is the most recent version of the course, reflecting ongoing refinements in pedagogy, statistical modeling techniques, and coding practices. It provides updated notebooks, R scripts, and model examples, some streamlined and restructured compared to previous years. The 2024 repo also highlights the transition toward more robust Stan models and integration with newer Bayesian workflow practices, continuing to emphasize accessibility for learners while modernizing the tools....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ggthemes

    ggthemes

    Additional themes, scales, and geoms for ggplot2

    ...It is often used to make ggplot2 plots adhere to aesthetic styles from famous news outlets, scientific journals, or presentation decks. Additional color scales and palettes for discrete and continuous data to match theme aesthetics. Extensive documentation and examples for each theme / scale so users can see how plots look and tweak them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Statistical Rethinking 2023

    Statistical Rethinking 2023

    Statistical Rethinking Course for Jan-Mar 2023

    ...It continues to provide scripts for lectures and tutorials, while integrating refinements to examples, notation, and computational workflows introduced that year. Compared with 2022, some models are rewritten for clarity, and teaching materials reflect refinements in McElreath’s evolving presentation of Bayesian data analysis. Students following the 2023 lecture videos use this repository as their coding reference. There are 10 weeks of instruction. Links to lecture recordings will appear in this table. Weekly problem sets are assigned on Fridays and due the next Friday, when we discuss the solutions in the weekly online meeting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AI-Agent-Host

    AI-Agent-Host

    The AI Agent Host is a module-based development environment.

    ...Being data-aware involves connecting a language model to other sources of data, enabling a comprehensive understanding and analysis of information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a direct hands-on reference for students following the 2022 recorded lecture series. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Reproducible-research

    Reproducible-research

    A Reproducible Data Analysis Workflow with R Markdown, Git, Make, etc.

    In this tutorial, we describe a workflow to ensure long-term reproducibility of R-based data analyses. The workflow leverages established tools and practices from software engineering. It combines the benefits of various open-source software tools including R Markdown, Git, Make, and Docker, whose interplay ensures seamless integration of version management, dynamic report generation conforming to various journal styles, and full cross-platform and long-term computational reproducibility. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    benchm-ml

    benchm-ml

    A benchmark of commonly used open source implementations

    This repository is designed to provide a minimal benchmark framework comparing commonly used machine learning libraries in terms of scalability, speed, and classification accuracy. The focus is on binary classification tasks without missing data, where inputs can be numeric or categorical (after one-hot encoding). It targets large scale settings by varying the number of observations (n) up to millions and the number of features (after expansion) to about a thousand, to stress test different implementations. The benchmarks cover algorithms like logistic regression, random forest, gradient boosting, and deep neural networks, and they compare across toolkits such as scikit-learn, R packages, xgboost, H2O, Spark MLlib, etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    bbplot

    bbplot

    R package that helps create and export ggplot2 charts

    ...It offers templates and defaults that reduce styling overhead so users can focus on data and storytelling rather than aesthetic minutiae. Because visual consistency is important in media, bbplot helps non-designers build plots that align with professional publication standards. The repository includes documentation, vignettes, example plots, and guidelines for customization (e.g. switching colors, modifying typography).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DataScienceR

    DataScienceR

    a curated list of R tutorials for Data Science, NLP

    The DataScienceR repository is a curated collection of tutorials, sample code, and project templates for learning data science using the R programming language. It includes an assortment of exercises, sample datasets, and instructional code that cover the core steps of a data science project: data ingestion, cleaning, exploratory analysis, modeling, evaluation, and visualization. Many of the modules demonstrate best practices in R, such as using the tidyverse, R Markdown, modular scripting, and reproducible workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Investing

    Investing

    Investing Returns on the Market as a Whole

    This repository, owned by the user zonination (Zoni Nation), presents a data visualization and analysis project on long-term returns from broad stock market indexes, especially the S&P 500. The author gathers historical price data (adjusted for inflation and dividends) and computes growth trajectories under a “buy and hold” strategy over decades. The key insight illustrated is that over sufficiently long holding periods (e.g. 40 years), the stock market stabilizes and nearly always yields positive returns, even accounting for extreme market crashes and recessions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    RStan

    RStan

    RStan, the R interface to Stan

    RStan is the R interface to Stan, a C++ library for statistical modeling and high-performance statistical computation. It lets users specify models in the Stan modeling language (for Bayesian inference), compile them, and perform inference from R. Key inference approaches include full Bayesian inference via Hamiltonian Monte Carlo (specifically the No-U-Turn Sampler, NUTS), approximate Bayesian inference via variational methods, and optimization (penalized likelihood). RStan integrates with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB