Showing 471 open source projects for "data modeling"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    BuildingAI

    BuildingAI

    Build your own AI application system for free

    BuildingAI is an open-source project focused on applying artificial intelligence techniques to architectural design and building information modeling workflows. The platform aims to bridge the gap between natural language interfaces and building design tools by allowing AI systems to interpret user instructions and convert them into structured architectural operations. By combining generative AI capabilities with building data models, the system can assist with tasks such as design generation, spatial reasoning, and building component creation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Glamorous Toolkit

    Glamorous Toolkit

    Glamorous Toolkit is the Moldable Development environment

    Programming, exploring data, browsing APIs, knowledge management, log investigations, domain modeling are all part of the same continuum. They require dedicated tools, but those tools can come to you in an integrated experience that is specific to your context. This is the essence of Moldable Development. And this is what Glamorous Toolkit makes practical. Glamorous Toolkit is the Moldable Development environment.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    brms

    brms

    brms R package for Bayesian generalized multivariate models using Stan

    brms is an R package by Paul Bürkner which provides a high-level interface for fitting Bayesian multilevel (i.e. mixed effects) models, generalized linear / non-linear / multivariate models using Stan as the backend. It allows R users to specify complex Bayesian models using formula syntax similar to lme4 but with far more flexibility (distributions, link functions, hierarchical structure, nonlinear terms, etc.). It supports model diagnostics, posterior predictive checking, model comparison,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Catalyst.jl

    Catalyst.jl

    Chemical reaction network and systems biology interface

    Catalyst.jl is a symbolic modeling package for analysis and high-performance simulation of chemical reaction networks. Catalyst defines symbolic ReactionSystems, which can be created programmatically or easily specified using Catalyst's domain-specific language (DSL). Leveraging ModelingToolkit and Symbolics.jl, Catalyst enables large-scale simulations through auto-vectorization and parallelism. Symbolic ReactionSystems can be used to generate ModelingToolkit-based models, allowing the easy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    PySINDy

    PySINDy

    A package for the sparse identification of nonlinear dynamical systems

    PySINDy is a Python library that implements the Sparse Identification of Nonlinear Dynamics (SINDy) method for discovering mathematical models of dynamical systems from data. The framework focuses on identifying governing equations that describe the behavior of complex physical systems by selecting sparse combinations of candidate functions. Instead of fitting a purely predictive machine learning model, PySINDy attempts to recover interpretable differential equations that explain how a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    EconML

    EconML

    Python Package for ML-Based Heterogeneous Treatment Effects Estimation

    EconML is a Python package for estimating heterogeneous treatment effects from observational data via machine learning. This package was designed and built as part of the ALICE project at Microsoft Research with the goal of combining state-of-the-art machine learning techniques with econometrics to bring automation to complex causal inference problems. One of the biggest promises of machine learning is to automate decision-making in a multitude of domains. At the core of many data-driven...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Machine Learning Study

    Machine Learning Study

    This repository is for helping those interested in machine learning

    ...It often demonstrates how to implement algorithms using widely used libraries such as NumPy, pandas, scikit-learn, and TensorFlow. Many examples include dataset preparation, visualization of results, and experimentation with different modeling approaches.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    NBA Sports Betting Machine Learning

    NBA Sports Betting Machine Learning

    NBA sports betting using machine learning

    NBA-Machine-Learning-Sports-Betting is an open-source Python project that applies machine learning techniques to predict outcomes of National Basketball Association games for analytical and betting-related research. The system gathers historical team statistics and game data spanning multiple seasons, beginning with the 2007–2008 NBA season and continuing through the present. Using this dataset, the project constructs matchup features that represent team performance trends and contextual...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    CBIG

    CBIG

    Computational Brain Imaging Group tools

    CBIG is a comprehensive toolkit maintained by Thomas Yeo’s Computational Brain Imaging Group containing tools for processing and analyzing neuroimaging data—including fMRI preprocessing pipelines, brain parcellation algorithms, mental disorder subtyping models, fMRI dynamic models, registrations between brain spaces, and phenotypic prediction algorithms. After cloning/downloading this repository, please see README inside setup directory to see instructions on how to set up your local...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Perceval

    Perceval

    An open source framework for programming photonic quantum computers

    An open-source framework for programming photonic quantum computers. Through a simple object-oriented Python API, Perceval provides tools for composing circuits from linear optical components, defining single-photon sources, manipulating Fock states, running simulations, reproducing published experimental papers and experimenting with a new generation of quantum algorithms. It aims to be a companion tool for developing photonic circuits – for simulating and optimizing their design, modeling...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Skytable

    Skytable

    Skytable is a fast, secure and reliable realtime NoSQL database

    SkytableTM is an insanely fast, free, and open-source, real-time NoSQL database that aims to provide flexible data modeling without compromising on performance or query ability, at scale. Skytable has got exciting features which are ready to deploy and more amazing features on the way. Scale to millions of queries per second per node with no optimizations left on the table. Automated background saving, snapshots and remote snapshots are there when you need them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    S2 Geometry

    S2 Geometry

    Computational geometry and spatial indexing on the sphere

    s2geometry is Google’s open source geometry library designed for representing, analyzing, and manipulating geometric shapes on a sphere rather than a flat plane. This makes it particularly suited for applications involving geospatial data, such as mapping, spatial indexing, and geographic information systems (GIS). The library provides a robust mathematical framework for spherical geometry, allowing developers to work with polygons, points, and regions on the Earth’s surface using consistent and precise algorithms. Unlike traditional 2D geometry libraries, S2 ensures accuracy over large scales by modeling the globe directly, avoiding distortions caused by map projections. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    UNO

    UNO

    A Universal Customization Method for Single and Multi Conditioning

    UNO is a project by ByteDance introduced in 2025, titled “A Universal Customization Method for Both Single and Multi-Subject Conditioning.” It suggests a framework for image (or more general generative) modeling where the model can be conditioned either on a single subject or multiple subjects — which may correspond to generating or customizing images featuring specific people, styles, or objects, possibly with fine-grained control over subject identity or composition. Because the project is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    mlforecast

    mlforecast

    Scalable machine learning for time series forecasting

    mlforecast is a time-series forecasting framework built around machine-learning models, designed to make forecasting both efficient and scalable. It lets you apply any regressor that follows the typical scikit-learn API, for example, gradient-boosted trees or linear models, to time-series data by automating much of the messy feature engineering and data preparation. Instead of writing custom code to build lagged features, rolling statistics, and date-based predictors, mlforecast generates...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Claude for Financial Services

    Claude for Financial Services

    Reference agents, skills, and data for the financial-services

    ...It supports deployment either as Claude Cowork plugins or through the Claude Managed Agents API, allowing organizations to integrate the same logic into internal systems and automation pipelines. The repository includes tools for competitive analysis, financial modeling, market research, data-pack generation, and strategic synthesis. Its architecture emphasizes modularity, enabling firms to customize workflows and extend functionality for proprietary use cases. Overall, the project serves as a foundation for building AI-enhanced financial research and decision-support systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Kaldi

    Kaldi

    kaldi-asr/kaldi is the official location of the Kaldi project

    ...Kaldi is designed for researchers who need a highly customizable environment to experiment with new algorithms, as well as for practitioners who want robust, production-ready ASR pipelines. It includes extensive tools for data preparation, feature extraction, acoustic and language modeling, decoding, and evaluation. With its modular design, Kaldi allows users to adapt the system to a wide range of languages and domains. As one of the most influential projects in speech recognition, it has become a foundation for much of the modern work in ASR.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Complete Node Bootcamp

    Complete Node Bootcamp

    Starter files, final projects and FAQ for my Complete Node.js Bootcamp

    ...It contains starter files, finished project files, and course support material for building backend applications with JavaScript. The repository is centered on practical server-side development, including Node.js fundamentals, Express APIs, MongoDB data modeling, authentication, security, payments, deployment, and real-world backend architecture. Learners can use the starter files to follow the lessons and compare their code with the final versions when something breaks. It also includes slides and FAQ-style guidance to make the course easier to navigate. The project is best understood as a hands-on educational workspace for learning production-minded Node.js development.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    NVIDIA PhysicsNeMo

    NVIDIA PhysicsNeMo

    Open-source deep-learning framework for building and training

    ...The framework focuses on the emerging field of physics-informed machine learning, where neural networks are used alongside physical equations to model complex scientific systems. PhysicsNeMo provides modular Python components that allow developers to create scalable training and inference pipelines for models that combine data-driven learning with physics-based constraints. It is built on top of the PyTorch ecosystem and integrates with GPU-accelerated computing environments to handle computationally demanding simulations and datasets. The framework supports a wide range of scientific applications, including computational fluid dynamics, climate modeling, weather prediction, and engineering simulations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Remult

    Remult

    Full-stack CRUD, simplified, with SSOT TypeScript entities

    Remult is a full-stack CRUD framework for building type-safe web applications using a single shared TypeScript model. It automatically exposes backend APIs based on your entities and provides real-time synchronization, role-based access control, and deep integration with front-end frameworks like React, Angular, and Vue. Remult simplifies full-stack development by unifying API and model definitions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Stock prediction deep neural learning

    Stock prediction deep neural learning

    Predicting stock prices using a TensorFlow LSTM

    Predicting stock prices can be a challenging task as it often does not follow any specific pattern. However, deep neural learning can be used to identify patterns through machine learning. One of the most effective techniques for series forecasting is using LSTM (long short-term memory) networks, which are a type of recurrent neural network (RNN) capable of remembering information over a long period of time. This makes them extremely useful for predicting stock prices. Predicting stock...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Large Concept Model

    Large Concept Model

    Language modeling in a sentence representation space

    Large Concept Model is a research codebase centered on concept-centric representation learning at scale, aiming to capture shared structure across many categories and modalities. It organizes training around concepts (rather than just raw labels), encouraging models to understand attributes, relations, and compositional structure that transfer across tasks. The repository provides training loops, data tooling, and evaluation routines to learn and probe these concept embeddings, typically...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Kalshi Trading Bot CLI

    Kalshi Trading Bot CLI

    AI-native CLI for trading Kalshi prediction markets

    Kalshi Trading Bot CLI is an AI-driven command-line tool designed to automate trading strategies on Kalshi prediction markets by combining quantitative modeling with real-time market data. It operates by conducting deep research on events, generating independent probability estimates, and comparing those estimates against current market prices to identify trading opportunities. The system incorporates advanced decision-making logic, including Kelly criterion-based position sizing and a structured multi-step risk evaluation process before executing trades. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    statsmodels

    statsmodels

    Statsmodels, statistical modeling and econometrics in Python

    statsmodels is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests, and statistical data exploration. An extensive list of result statistics are available for each estimator. The results are tested against existing statistical packages to ensure that they are correct. The package is released under the open source Modified BSD (3-clause) license. Generalized linear models with support for all...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Sanity

    Sanity

    Rapidly configure content workspaces powered by structured content

    Sanity is an open-source real-time headless content management system that allows developers to manage structured content for websites, applications, and digital platforms. At the core of the system is Sanity Studio, a customizable editing environment built with React that can be configured to match the workflows and content models of different teams. Instead of using predefined content templates, Sanity allows developers to define schemas in code that determine how content is structured and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    deepjazz

    deepjazz

    Deep learning driven jazz generation using Keras & Theano

    deepjazz is a deep learning project that generates jazz music using recurrent neural networks trained on MIDI files. The repository demonstrates how machine learning can learn musical structure and produce original compositions. It uses the Keras and Theano libraries to build a two-layer Long Short-Term Memory network capable of learning temporal patterns in music. The system analyzes musical sequences from an input MIDI file and then generates new musical notes that follow similar stylistic...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB