Showing 471 open source projects for "data modeling"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    Hello SQL

    Hello SQL

    Spanish-language course repository that teaches fundamentals of SQL

    ...The materials emphasize real-world query writing, schema design basics, and the mental model behind SELECT, JOIN, GROUP BY, and subqueries. Learners progress from setup and connection to hands-on exercises that build confidence with CRUD operations and data modeling. The repository’s structure favors incremental learning, with clear folders, references, and exercises you can run locally. It targets absolute beginners as well as developers from other stacks who want a clean, project-based path into SQL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    JiT

    JiT

    PyTorch implementation of JiT

    JiT is an open-source PyTorch implementation of a state-of-the-art image diffusion model designed around a minimalist yet powerful architecture for pixel-level generative modeling, based on the paper Back to Basics: Let Denoising Generative Models Denoise. Rather than predicting noise, JiT models directly predict clean image data, which the research suggests aligns better with the manifold structure of natural images and leads to stronger generative performance at high resolution. This implementation supports training on large datasets like ImageNet with configurable model variants, and practical scripts for setup, training, and evaluation on GPUs are included, leveraging PyTorch’s ecosystem for real-world experimentation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    GLM.jl

    GLM.jl

    Generalized linear models in Julia

    GLM.jl is a Julia package for fitting linear and generalized linear models (GLMs) with a syntax and functionality familiar to users of R or other statistical environments. It is part of the JuliaStats ecosystem and is tightly integrated with StatsModels.jl for formula handling, and Distributions.jl for specifying error families. The package supports modeling through both formula-based (e.g. @formula) and matrix-based interfaces, allowing both high-level convenience and low-level control....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Perceval

    Perceval

    An open source framework for programming photonic quantum computers

    An open-source framework for programming photonic quantum computers. Through a simple object-oriented Python API, Perceval provides tools for composing circuits from linear optical components, defining single-photon sources, manipulating Fock states, running simulations, reproducing published experimental papers and experimenting with a new generation of quantum algorithms. It aims to be a companion tool for developing photonic circuits – for simulating and optimizing their design, modeling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    Tokenize.jl

    Tokenize.jl

    Tokenization for Julia source code

    Tokenize is a Julia package that serves a similar purpose and API as the tokenize module in Python but for Julia. This is to take a string or buffer containing Julia code, perform lexical analysis and return a stream of tokens.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    OpenFGA
    ...OpenFGA takes the best ideas from Google's Zanzibar paper for Relationship-Based Access Control, and also solves problems for Role-based Access Control and Attribute-Based Access Control use cases. The modeling language is powerful enough for engineers, but friendly enough for other stakeholders on your team as well.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Featuretools

    Featuretools

    An open source python library for automated feature engineering

    ...Featuretools automatically creates features from temporal and relational datasets. Featuretools uses DFS for automated feature engineering. You can combine your raw data with what you know about your data to build meaningful features for machine learning and predictive modeling. Featuretools provides APIs to ensure only valid data is used for calculations, keeping your feature vectors safe from common label leakage problems. You can specify prediction times row-by-row. Featuretools come with a library of low-level functions that can be stacked to create features. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    webR

    webR

    The statistical language R compiled to WebAssembly via Emscripten

    ...The project includes a web-based integrated development environment that uses tools like xterm.js and CodeMirror, allowing users to interact with R through a familiar console and editor interface. It supports installing and running R packages, making it possible to perform data analysis, statistical modeling, and visualization entirely client-side. webR also provides distribution options such as npm packages, CDN hosting, and Docker images for flexible deployment. While it currently includes a minimal set of compiled libraries, it is designed to expand its ecosystem over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AutoMLPipeline.jl

    AutoMLPipeline.jl

    Package that makes it trivial to create and evaluate machine learning

    ...To illustrate, here is a pipeline expression and evaluation of a typical machine learning workflow that extracts numerical features (numf) for ica (Independent Component Analysis) and pca (Principal Component Analysis) transformations, respectively, concatenated with the hot-bit encoding (ohe) of categorical features (catf) of a given data for rf (Random Forest) modeling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    BAML

    BAML

    The AI framework that adds the engineering to prompt engineering

    BAML is an open-source framework and domain-specific language designed to bring structured engineering practices to prompt development for large language model applications. Instead of treating prompts as unstructured text, BAML introduces a schema-driven approach where prompts are defined as typed functions with explicit inputs and outputs. This design allows developers to treat language model interactions as predictable software components rather than ad-hoc prompt strings. The framework...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    StatsForecast

    StatsForecast

    Fast forecasting with statistical and econometric models

    StatsForecast is a Python library for time-series forecasting that delivers a suite of classical statistical and econometric forecasting models optimized for high performance and scalability. It is designed not just for academic experiments but for production-level time-series forecasting, meaning it handles forecasting for many series at once, efficiently, reliably, and with minimal overhead. The library implements a broad set of models, including AutoARIMA, ETS, CES, Theta, plus a battery...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    MetricFlow

    MetricFlow

    MetricFlow allows you to define, build, and maintain metrics in code

    MetricFlow is an open-source semantic layer engine designed to help organizations define, manage, and query business metrics in a consistent, governed way. It works alongside a data stack—typically built with dbt—and allows you to express metrics as YAML‐based definitions tied to semantic models and dimension tables, rather than embedding logic ad-hoc across many dashboards or scripts. When a user or tool requests a metric (e.g., “monthly revenue by region”), MetricFlow generates optimized,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OpenAI DALL·E AsyncImage SwiftUI

    OpenAI DALL·E AsyncImage SwiftUI

    OpenAI swift async text to image for SwiftUI app using OpenAI

    ...In machine learning, diffusion models, also known as diffusion probabilistic models, are a class of latent variable models. They are Markov chains trained using variational inference. The goal of diffusion models is to learn the latent structure of a dataset by modeling the way in which data points diffuse through the latent space.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    R packages (maintained by YJLEE)

    R packages (maintained by YJLEE)

    R packages for PK/PD modeling , BE/BA, drug stability, ivivc, etc.

    These R packages are developed for data analysis of PK/PD modeling & simulation, bioequivalence/bioavailability (BE/BA), drug stability, in-vitro and in-vivo correlation (ivivc), as well as therapeutic drug monitoring (TDM).
    Downloads: 18 This Week
    Last Update:
    See Project
  • 16
    Blender

    Blender

    Free and open source 3D creation software

    Blender is a free and open-source 3D creation suite used by artists, designers, and developers around the world to create everything from animations to visual effects. It supports the entire 3D pipeline, including modeling, sculpting, animation, rendering, compositing, and video editing. Blender 5.0 introduces major workflow and performance improvements, with enhanced Geometry Nodes and better volumetric data support. Its powerful Cycles render engine delivers high-quality, photorealistic results using both CPU and GPU rendering. Blender is backed by a strong global community and supported by major hardware and industry organizations. ...
    Leader badge
    Downloads: 62 This Week
    Last Update:
    See Project
  • 17
    Transformers in Time Series

    Transformers in Time Series

    A professionally curated list of awesome resources

    Transformers in Time Series is a curated research repository that collects academic papers, code implementations, datasets, and learning resources related to transformer models for time series analysis. The project was created to systematically organize the rapidly growing research field that applies transformer architectures to time series modeling tasks. It compiles literature from major conferences and journals and categorizes them by application domains such as forecasting, anomaly detection, and classification. The repository also provides a taxonomy that helps researchers understand different architectural variations of transformers designed for time series data. These models are particularly important because transformers can capture long-range dependencies in sequential data, which makes them well suited for complex temporal patterns in real-world datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AV3DNavigator

    AV3DNavigator

    Um simples software de navegação 3D, desenvolvido em Java.

    Um simples software de navegação 3D, desenvolvido em Java.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    GMAT

    GMAT

    General Mission Analysis Tool

    The General Mission Analysis Tool (GMAT) is an open-source tool for space mission design and navigation. GMAT is developed by a team of NASA, private industry, and public and private contributors. The GMAT development team is pleased to announce the release of GMAT version R2026a. For a complete list of new features, compatibility changes, and bug fixes, see the R2026a Release Notes in the Users Guide.
    Leader badge
    Downloads: 1,322 This Week
    Last Update:
    See Project
  • 20
    Hasktorch

    Hasktorch

    Tensors and neural networks in Haskell

    Hasktorch is a powerful Haskell library for tensor computation and neural network modeling, built on top of libtorch (the backend of PyTorch). It brings differentiable programming, automatic differentiation, and efficient tensor operations into Haskell’s strongly typed functional paradigm. This project is in active development, so expect changes to the library API as it evolves. We would like to invite new users to join our Hasktorch discord space for questions and discussions....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ATOMKIT
    ATOMKIT software aims to provide researchers and engineers with a comprehensive tool for convenient handling of crystal structure data and performing various simulations and analyses. Here are the key features of ATOMKIT software: (1) Crystal structure manipulation: Users can perform operations such as rotation, translation, scaling to adjust or edit the crystal structure as needed. (2 ) Modeling capabilities: Users can quickly generate crystal structures using input data such as atomic coordinates and unit cell parameters, and further edit and modify them as required...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 22
    Improved Diffusion

    Improved Diffusion

    Release for Improved Denoising Diffusion Probabilistic Models

    ...By making this code available, OpenAI provides a foundation for further experimentation and development in generative modeling research.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Avogadro

    Avogadro

    An intuitive molecular editor and visualization tool

    Avogadro is an advanced molecular editor designed for cross-platform use in computational chemistry, molecular modeling, bioinformatics, materials science and related areas. It offers a flexible rendering framework and a powerful plugin architecture.
    Leader badge
    Downloads: 820 This Week
    Last Update:
    See Project
  • 24
    The GEM 3D engine is a portable, free, open source, easy to use, software development kit which allows you to build high performance 3D graphics applications such as games.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    torchtext

    torchtext

    Data loaders and abstractions for text and NLP

    We recommend Anaconda as a Python package management system. Please refer to pytorch.org for the details of PyTorch installation. LTS versions are distributed through a different channel than the other versioned releases. Alternatively, you might want to use the Moses tokenizer port in SacreMoses (split from NLTK). You have to install SacreMoses. To build torchtext from source, you need git, CMake and C++11 compiler such as g++. When building from source, make sure that you have the same C++...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB