Search Results for "data modeling" - Page 10

Showing 471 open source projects for "data modeling"

View related business solutions
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    MLOps Course

    MLOps Course

    Learn how to design, develop, deploy and iterate on ML apps

    The MLOps Course by Goku Mohandas is an open-source curriculum that teaches how to combine machine learning with solid software engineering to build production-grade ML applications. It is structured around the full lifecycle: data pipelines, modeling, experiment tracking, deployment, testing, monitoring, and iteration. The repository itself contains configuration, code examples, and links to accompanying lessons hosted on the Made With ML site, which provide detailed narrative explanations and diagrams. Instead of focusing only on model training, the course emphasizes best practices like modular code design, CI/CD, containerization, reproducibility, and responsible ML (including monitoring and feedback loops). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    BMC

    BMC

    Notes on Scientific Computing for Biomechanics

    This repository is a collection of lecture notes and code on scientific computing and data analysis for Biomechanics and Motor Control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    XLM (Cross-lingual Language Model)

    XLM (Cross-lingual Language Model)

    PyTorch original implementation of Cross-lingual Language Model

    XLM (Cross-lingual Language Model) is a family of multilingual pretraining methods that align representations across languages to enable strong zero-shot transfer. It popularized objectives like Masked Language Modeling (MLM) across many languages and Translation Language Modeling (TLM) that jointly trains on parallel sentence pairs to tighten cross-lingual alignment. Using a shared subword vocabulary, XLM learns language-agnostic features that work well for classification and sequence...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Stats With Julia Book

    Stats With Julia Book

    Collection of runnable Julia code examples for a statistics book

    ...Readers can explore how Julia supports statistical modeling, simulation, and computational methods in data science workflows. The included initialization script simplifies package setup, ensuring that learners can focus on running and modifying the code examples. This project bridges the gap between textbook learning and hands-on coding, making it a valuable educational tool for students, researchers, and practitioners.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 5
    RESTful API Node Server Boilerplate

    RESTful API Node Server Boilerplate

    A boilerplate for building production-ready RESTful APIs using Node.js

    A boilerplate/starter project for quickly building RESTful APIs using Node.js, Express, and Mongoose. By running a single command, you will get a production-ready Node.js app installed and fully configured on your machine. The app comes with many built-in features, such as authentication using JWT, request validation, unit and integration tests, continuous integration, docker support, API documentation, pagination, etc. The app has a utility ApiError class to which you can attach a response...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    covid19model

    covid19model

    Code for modelling estimated deaths and cases for COVID19

    Code for modeling estimated deaths and infections for COVID-19 from "Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe", Flaxman, Mishra, Gandy et al, Nature, 2020, the published version of our original Report 13. This is the release related to our Tiers paper, where we use the latent factor model to estimate the effectiveness of tiers systems in England. Peer-reviewed version is to be out soon. All other code is still the same for previous releases. The code...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Objectron

    Objectron

    A dataset of short, object-centric video clips

    The Objectron dataset is a collection of short, object-centric video clips, which are accompanied by AR session metadata that includes camera poses, sparse point-clouds and characterization of the planar surfaces in the surrounding environment. In each video, the camera moves around the object, capturing it from different angles. The data also contain manually annotated 3D bounding boxes for each object, which describe the object’s position, orientation, and dimensions. The dataset consists...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Pipeline for training Language Models

    Pipeline for training Language Models

    Pipeline for training Language Models using PyTorch.

    Pipeline for training Language Models using PyTorch. Inspired by Yandex Data School NLP Course (week 03: Language Modeling) Prepared text file with space-separated words on each line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    This data set contains data for virtual experiments for an undergraduate course from the Pontificia Universidad Católica del Perú. It constitutes the Supporting Information for a scientific article entitled "Application of blended physical-virtual experiments and flipped instruction in a fundamental fluid mechanics course: a design-based research study" submitted to Computer Applications in Engineering Education. The educational material contained in this dataset is shared under the GNU...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Prisma 1

    Prisma 1

    Database Tools incl. ORM, migrations and admin UI

    ...Prisma replaces traditional ORMs and simplifies database workflows. Access, Type-safe database access with the auto-generated Prisma client (in JavaScript, TypeScript, Go). Migrate, declarative data modeling and migrations (optional). Manage, visual data management with Prisma Admin. It is used to build GraphQL, REST, gRPC APIs and a lot more. Prisma currently supports MySQL, PostgreSQL, MongoDB. Prisma is a great fit for building REST& gRPC APIs where it can be used in place of traditional ORMs. It provides many benefits such as type safety, a modern API and flexible ways for reading and writing relational data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    OpenAI Glow

    OpenAI Glow

    Copy code in "Glow: Generative Flow with Invertible 1x1 Convolutions"

    Glow is an open source generative model released by OpenAI that demonstrates flow-based generative modeling techniques. Unlike models that rely on approximate inference, Glow uses invertible transformations to directly learn the data distribution, allowing for exact likelihood computation and efficient sampling. The model is capable of producing high-quality synthetic images while maintaining interpretable latent spaces that enable meaningful manipulation of generated outputs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Image GPT

    Image GPT

    Large-scale autoregressive pixel model for image generation by OpenAI

    Image-GPT is the official research code and models from OpenAI’s paper Generative Pretraining from Pixels. The project adapts GPT-2 to the image domain, showing that the same transformer architecture can model sequences of pixels without altering its fundamental structure. It provides scripts to download pretrained checkpoints of different model sizes (small, medium, large) trained on large-scale datasets and includes utilities for handling color quantization with a 9-bit palette....
    Downloads: 8 This Week
    Last Update:
    See Project
  • 13
    Makani

    Makani

    Makani was developed a commercial-scale airborne wind turbine

    Makani was an ambitious Google X project that sought to harness wind energy using airborne wind turbines — autonomous kites capable of generating power while flying in crosswind patterns. This open-source repository contains the complete software stack that powered Makani’s research and flight systems, including the flight simulator, autopilot controller, avionics firmware, visualization tools, and ground control software. The software enables simulation, control, and analysis of the Makani...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    LIFETIMES

    LIFETIMES

    Lifetime value in Python

    LIFETIMES is a Python library for customer lifetime value and repeat purchase behavior modeling. It helps analysts estimate how frequently customers may return, how long they may remain active, and how much value they may generate over time. The library is built around probabilistic models commonly used in customer analytics, including transaction frequency and monetary value modeling. It is useful for ecommerce, subscription-adjacent businesses, retail analytics, and retention analysis. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    EliteQuant

    EliteQuant

    A list of online resources for quantitative modeling, trading, etc.

    EliteQuant is a curated directory of online resources for quantitative finance: trading, portfolio management, quantitative modeling, data sources, libraries, platforms, and communities. It is not a software library per se, but a “list of things” - i.e., an aggregator of open source projects, blogs, tools etc., intended to help practitioners find useful resources. It is licensed under Apache-2.0, and maintained by volunteers. A list of online resources for quantitative modeling, trading, and portfolio management. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    We are developing data standards and software tools that implement these standards to develop a systemic approach to modeling, capturing, analyzing and disseminating flow cytometry data.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Quantitative-Notebooks

    Quantitative-Notebooks

    Educational notebooks on quantitative finance, algorithmic trading

    ...Because quantitative analysis often requires visualization, statistics, and time series processing, these notebooks also serve as templates for real financial research and strategy prototyping. Users can adapt the examples to their own data sources, financial instruments, and modeling techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The Object-Role Modeling (ORM) standard version 2, associated schemas and generation tools, and a reference implementation in the form of the Natural Object-Role Modeling Architect for Visual Studio (NORMA) product.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Albedo

    Albedo

    A recommender system for discovering GitHub repos

    Albedo is an open-source recommender system aimed at helping developers discover GitHub repositories by learning from activity signals. It treats repositories and developers as a graph of interactions and applies large-scale matrix factorization to model affinities, with Apache Spark providing the distributed data processing. The project focuses on implicit feedback—stars, watches, and other engagement metrics—so it can build useful recommendations without explicit ratings. A reproducible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The latest ESMF development is happening in GitHub: https://github.com/esmf-org/esmf https://earthsystemmodeling.org The Earth System Modeling Framework provides high-performance software infrastructure and superstructure for the construction and coupling of climate, weather, and data assimilation applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Newsvendor Model Simulation Spreadsheet

    Excel Spreadsheet Model for Single Period Inventory Problems

    The spreadsheet (Excel) of a single-period inventory model with stochastic demand can be used as a simulation tool for engineering education or Decision Support System. Based on spreadsheet techniques and examples described in the following sources: Albright S. C., & Winston W. L. (2005). Spreadsheet modeling and applications: essentials of practical management science, South-Western Pub. Albright, S. C. W. C., Winston, W., & Zappe, C. (2010). Data analysis and decision making. Cengage Learning. Hill, A. V. (2011). The newsvendor problem. White Paper, 57-23. Lawrence, J. A., & Pasternack, B. A. (2002). Applied management science. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22

    PBTK Optimizer

    Application for optimization of parameters in PBTK models

    ...Other parameters can be determined through in-vitro experiments or through extrapolation using published equations. When it is impractical to use these methods to estimate a parameter, techniques can be used to optimize parameters so that model results best fit validation data. This tool was designed to optimize a user-specified list of parameters to a user-specified PBTK model. The user also controls validation data and optimization algorithms. In addition to optimized parameters, the tool outputs statistical information about the fit of the optimized model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    swirl

    swirl

    Learn R, in R

    swirl is an R package that allows interactive, in-R learning of statistics, data science, R programming etc. The idea is that you load swirl in R, and it presents you with lessons (within R’s console or RStudio) that ask you to type commands, check results, and progress through tutorial material—without leaving the R environment. It is used for teaching R, especially for beginners, as well as for self-paced learning of packages, data manipulation, visualization, etc. Lessons and content are...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    YouTube-8M

    YouTube-8M

    Starter code for working with the YouTube-8M dataset

    youtube-8m is Google’s open source starter code and reference implementation for training and evaluating machine learning models on the YouTube-8M dataset, one of the largest video understanding datasets publicly released. The repository provides a complete pipeline for video-level and frame-level modeling using TensorFlow, including data reading, model training, evaluation, and inference. It was developed to support the YouTube-8M Video Understanding Challenge (hosted on Kaggle and featured at ICCV 2019), enabling researchers and practitioners to benchmark video classification models on large-scale datasets with over millions of labeled videos. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Scikit-learn Tutorial

    Scikit-learn Tutorial

    An introductory tutorial for scikit-learn

    Scikit-learn Tutorial contains the materials for Jake VanderPlas’s introductory scikit-learn tutorial, originally used at major Python conferences. It provides a collection of notebooks that walk attendees from basic machine-learning concepts into practical modeling using the scikit-learn library. The tutorial covers data preparation, model fitting, evaluation, and common algorithms such as classification, regression, clustering, and dimensionality reduction. It is designed for people who already have a working Python environment and some familiarity with NumPy, SciPy, and Matplotlib. The repository specifies a clear list of dependencies so that participants can reproduce the environment used in the tutorial, and many downstream forks keep the content updated for newer versions of scikit-learn. ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB