Showing 28 open source projects for "data modeling"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    NYC Taxi Data

    NYC Taxi Data

    Import public NYC taxi and for-hire vehicle (Uber, Lyft)

    The nyc-taxi-data repository is a rich dataset and exploratory project around New York City taxi trip records. It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    RStudio Cheatsheets

    RStudio Cheatsheets

    Curated collection of official cheat sheets for data science tools

    ...It covers topics such as data wrangling, data import, modeling, visualization, RStudio IDE shortcuts, Shiny development, and the tidyverse suite (dplyr, ggplot2, tidyr, purrr). These cheat sheets are widely used by R learners, educators, and practitioners as quick reference tools, and they often ship with RStudio by default or are linked from RStudio’s help/documentation pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Easy3D

    Easy3D

    Efficient library for processing 3D data

    Easy3D is a lightweight, easy-to-use, and efficient library for processing and rendering 3D data, implemented in C++ with Python bindings. It is designed for tasks such as 3D modeling, geometry processing, and rendering, emphasizing simplicity and efficiency. Easy3D serves as a valuable tool for research, education, and the development of sophisticated 3D applications, providing a solid foundation for handling 3D data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    XState

    XState

    State machines and statecharts for the modern web

    JavaScript and TypeScript finite state machines and statecharts for the modern web. Statecharts are a formalism for modeling stateful, reactive systems. This is useful for declaratively describing the behavior of your application, from the individual components to the overall application logic. XState is a library for creating, interpreting, and executing finite state machines and statecharts, as well as managing invocations of those machines as actors. The following fundamental computer...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    AtomicServer

    AtomicServer

    An open source headless CMS / real-time database

    Create, share, fetch, and model Atomic Data! AtomicServer is a lightweight, yet powerful CMS / Graph Database. Atomic Data is a modular specification for sharing, modifying, and modeling graph data. It combines the ease of use of JSON, the connectivity of RDF (linked data), and the reliability of type-safety. Atomic Data uses links to connect pieces of data, and therefore makes it easier to connect datasets to each other - even when these datasets exist on separate machines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Gen.jl

    Gen.jl

    A general-purpose probabilistic programming system

    ...Neural network inference is fast, but can be inaccurate on out-of-distribution data, and requires expensive training.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Cesium

    Cesium

    An open-source JavaScript library for world-class 3D globes and maps

    CesiumJS is an open source JavaScript library for creating world-class 3D globes and maps with the best possible performance, precision, visual quality, and ease of use. Developers across industries, from aerospace to smart cities to drones, use CesiumJS to create interactive web apps for sharing dynamic geospatial data. Built on open formats, CesiumJS is designed for robust interoperability and scaling for massive datasets. CesiumJS is released under the Apache 2.0 license and is free for...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Stock prediction deep neural learning

    Stock prediction deep neural learning

    Predicting stock prices using a TensorFlow LSTM

    Predicting stock prices can be a challenging task as it often does not follow any specific pattern. However, deep neural learning can be used to identify patterns through machine learning. One of the most effective techniques for series forecasting is using LSTM (long short-term memory) networks, which are a type of recurrent neural network (RNN) capable of remembering information over a long period of time. This makes them extremely useful for predicting stock prices. Predicting stock...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Complete Node Bootcamp

    Complete Node Bootcamp

    Starter files, final projects and FAQ for my Complete Node.js Bootcamp

    ...It contains starter files, finished project files, and course support material for building backend applications with JavaScript. The repository is centered on practical server-side development, including Node.js fundamentals, Express APIs, MongoDB data modeling, authentication, security, payments, deployment, and real-world backend architecture. Learners can use the starter files to follow the lessons and compare their code with the final versions when something breaks. It also includes slides and FAQ-style guidance to make the course easier to navigate. The project is best understood as a hands-on educational workspace for learning production-minded Node.js development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10
    GLM.jl

    GLM.jl

    Generalized linear models in Julia

    GLM.jl is a Julia package for fitting linear and generalized linear models (GLMs) with a syntax and functionality familiar to users of R or other statistical environments. It is part of the JuliaStats ecosystem and is tightly integrated with StatsModels.jl for formula handling, and Distributions.jl for specifying error families. The package supports modeling through both formula-based (e.g. @formula) and matrix-based interfaces, allowing both high-level convenience and low-level control....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Tokenize.jl

    Tokenize.jl

    Tokenization for Julia source code

    Tokenize is a Julia package that serves a similar purpose and API as the tokenize module in Python but for Julia. This is to take a string or buffer containing Julia code, perform lexical analysis and return a stream of tokens.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    Hasktorch

    Hasktorch

    Tensors and neural networks in Haskell

    Hasktorch is a powerful Haskell library for tensor computation and neural network modeling, built on top of libtorch (the backend of PyTorch). It brings differentiable programming, automatic differentiation, and efficient tensor operations into Haskell’s strongly typed functional paradigm. This project is in active development, so expect changes to the library API as it evolves. We would like to invite new users to join our Hasktorch discord space for questions and discussions....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Statistical Rethinking 2024

    Statistical Rethinking 2024

    This course teaches data analysis

    The 2024 repository is the most recent version of the course, reflecting ongoing refinements in pedagogy, statistical modeling techniques, and coding practices. It provides updated notebooks, R scripts, and model examples, some streamlined and restructured compared to previous years. The 2024 repo also highlights the transition toward more robust Stan models and integration with newer Bayesian workflow practices, continuing to emphasize accessibility for learners while modernizing the tools....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    BMC

    BMC

    Notes on Scientific Computing for Biomechanics

    This repository is a collection of lecture notes and code on scientific computing and data analysis for Biomechanics and Motor Control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TenorSpace.js

    TenorSpace.js

    Neural network 3D visualization framework

    TensorSpace is a neural network 3D visualization framework built using TensorFlow.js, Three.js and Tween.js. TensorSpace provides Keras-like APIs to build deep learning layers, load pre-trained models, and generate a 3D visualization in the browser. From TensorSpace, it is intuitive to learn what the model structure is, how the model is trained and how the model predicts the results based on the intermediate information. After preprocessing the model, TensorSpace supports the visualization...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    data-science-ipython-notebooks

    data-science-ipython-notebooks

    Data science Python notebooks: Deep learning

    Data Science IPython Notebooks is a broad, curated set of Jupyter notebooks covering Python, data wrangling, visualization, machine learning, deep learning, and big data tools. It aims to be a practical map of the ecosystem, showing hands-on examples with libraries such as NumPy, pandas, matplotlib, scikit-learn, and others. Many notebooks introduce concepts step by step, then apply them to real datasets so readers can see techniques in action. Advanced sections touch on neural networks and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    tcomb

    tcomb

    Type checking and DDD for JavaScript

    ...Its main value is bringing lightweight, expressive runtime type modeling to JavaScript applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    Incanter

    Incanter

    Clojure-based, R-like statistical computing and graphics environment

    Incanter is a Clojure-based, R-like statistical computing and visualization library running on the JVM. It integrates core numerical libraries like Parallel Colt and JFreeChart to deliver data manipulation, modeling, statistical tests, and charting in a REPL-friendly environment. Start by visiting the Incanter website for an overview, check out the documentation page for a listing of HOW-TOs and examples, and then download either an Incanter executable or a pre-built version of the latest build of Incanter, which includes all the necessary dependencies, and unpack the file (if you would like to build it from source, read Building Incanter). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DataScienceR

    DataScienceR

    a curated list of R tutorials for Data Science, NLP

    The DataScienceR repository is a curated collection of tutorials, sample code, and project templates for learning data science using the R programming language. It includes an assortment of exercises, sample datasets, and instructional code that cover the core steps of a data science project: data ingestion, cleaning, exploratory analysis, modeling, evaluation, and visualization. Many of the modules demonstrate best practices in R, such as using the tidyverse, R Markdown, modular scripting, and reproducible workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    ImatiSTL

    Fix STL files for 3D printing

    ImatiSTL is a C++ library for applications that need to finely post-process raw polygon meshes represented by STL files. It is particularly useful in 3D printing scenarios: the repairing functions provided by ImatiSTL bridge the gap between printable meshes and real-world meshes with diverse potential defects and flaws (e.g. degenerate triangles, self-intersections, surface holes, ...). ImatiSTL provides both functions for local mesh repairing and methods for global remeshing. Local...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    torch-rnn

    torch-rnn

    Efficient, reusable RNNs and LSTMs for torch

    The torch-rnn project is a lightweight and efficient implementation of recurrent neural networks built on the Torch framework, focusing on flexibility and reusability for sequence modeling tasks. It provides implementations of standard RNNs and long short-term memory networks, enabling users to train models for tasks such as text generation, language modeling, and sequence prediction. The repository emphasizes simplicity and performance, offering a streamlined pipeline for preprocessing data, training models, and sampling outputs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Importer library to import assets from different common 3D file formats such as Collada, Blend, Obj, X, 3DS, LWO, MD5, MD2, MD3, MDL, MS3D and a lot of other formats. The data is stored in an own in-memory data-format, which can be easily processed. www.open3mod.com/ is a 3D model viewer and exporter based on Assimp that is also Open Source.
    Downloads: 26 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB