Showing 673 open source projects for "data modeling"

View related business solutions
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    atk4/data

    atk4/data

    Data Access PHP Framework for SQL & high-latency databases

    ATK Data is a data persistence and modeling framework for PHP, developed as part of the Agile Toolkit. It provides a high-level abstraction for working with databases, making it easier to define and manipulate data models with minimal boilerplate code. It supports various SQL and NoSQL databases and integrates seamlessly with Agile UI and other PHP frameworks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    NYC Taxi Data

    NYC Taxi Data

    Import public NYC taxi and for-hire vehicle (Uber, Lyft)

    The nyc-taxi-data repository is a rich dataset and exploratory project around New York City taxi trip records. It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    YALMIP

    YALMIP

    MATLAB toolbox for optimization modeling

    MATLAB toolbox for optimization modeling.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    Open source machine learning and data visualization. Build data analysis workflows visually, with a large, diverse toolbox. Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Even your multidimensional data can become sensible in 2D, especially with clever attribute ranking and selections. Interactive data exploration for...
    Downloads: 34 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Synthetic Data Vault (SDV)

    Synthetic Data Vault (SDV)

    Synthetic Data Generation for tabular, relational and time series data

    ...Additionally, it enables the testing of Machine Learning or other data dependent software systems without the risk of exposure that comes with data disclosure. Underneath the hood it uses several probabilistic graphical modeling and deep learning based techniques. To enable a variety of data storage structures, we employ unique hierarchical generative modeling and recursive sampling techniques.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Python for Data Analysis

    Python for Data Analysis

    Materials and IPython notebooks for "Python for Data Analysis"

    Python for Data Analysis is the official companion repository for Python for Data Analysis, 3rd Edition by Wes McKinney. It contains the datasets, examples, and IPython notebooks used throughout the book. The repository helps readers practice Python data analysis concepts directly in Jupyter Notebook. Its chapters cover Python basics, NumPy, pandas, data loading, cleaning, wrangling, visualization, time series, modeling libraries, and full analysis examples. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Dynamoose

    Dynamoose

    Dynamoose is a modeling tool for Amazon's DynamoDB

    Dynamoose is a modeling tool for Amazon's DynamoDB, providing a simple and schema-based solution to interact with DynamoDB tables in Node.js applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Agentic Data Scientist

    Agentic Data Scientist

    An end-to-end Data Scientist

    Agentic Data Scientist is an experimental AI-driven research framework that orchestrates data science workflows through autonomous agents that can reason, plan, and execute complex analytics tasks. Unlike traditional scripted pipelines, this project lets AI agents break down high-level research goals into sub-tasks such as data acquisition, cleaning, modeling, evaluation, and reporting, with minimal human direction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    pgModeler

    pgModeler

    Open-source data modeling tool designed for PostgreSQL

    pgModeler is an open-source data modeling tool for PostgreSQL, enabling visual creation and management of database schemas. It supports reverse engineering from existing databases, model validation, and SQL export, providing a full-featured GUI for database design. pgModeler is suited for developers, DBAs, and analysts who want to streamline schema creation and documentation.
    Downloads: 50 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    PowerSimulationsDynamics.jl

    PowerSimulationsDynamics.jl

    Julia package to run Dynamic Power System simulations

    PowerSimulationsDynamics.jl is a Julia package for power system modeling and simulation of Power Systems dynamics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PowerSimulations.jl

    PowerSimulations.jl

    Julia for optimization simulation and modeling of PowerSystems

    PowerSimulations.jl is a Julia package for power system modeling and simulation of Power Systems operations. Provide a flexible modeling framework that can accommodate problems of different complexity and at different time scales. Streamline the construction of large-scale optimization problems to avoid repetition of work when adding/modifying model details. Exploit Julia's capabilities to improve computational performance of large-scale power system quasi-static simulations. The flexible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Perfect Roadmap To Learn Data Science

    Perfect Roadmap To Learn Data Science

    Basic To Intermediate Python data science guide

    ...What makes it particularly valuable is its holistic nature: rather than focusing only on modeling or theory, it also addresses the broader lifecycle of data-science work, data ingestion, cleaning, EDA, feature engineering, model building, validation, deployment, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Blender GIS

    Blender GIS

    Blender addons to make the bridge between Blender and geographic data

    Import in Blender most commons GIS data format, Shapefile vector, raster image, geotiff DEM, OpenStreetMap XML. There are a lot of possibilities to create a 3D terrain from geographic data with BlenderGIS, check the Flowchart to have an overview. Display dynamics web maps inside Blender 3d view, requests for OpenStreetMap data (buildings, roads, etc.), get true elevation data from the NASA SRTM mission. Manage georeferencing information of a scene, compute a terrain mesh by Delaunay...
    Downloads: 78 This Week
    Last Update:
    See Project
  • 14
    ggstatsplot

    ggstatsplot

    Enhancing {ggplot2} plots with statistical analysis

    {ggstatsplot} is an extension of {ggplot2} package for creating graphics with details from statistical tests included in the information-rich plots themselves. In a typical exploratory data analysis workflow, data visualization and statistical modeling are two different phases: visualization informs modeling, and modeling in its turn can suggest a different visualization method, and so on and so forth. Bayesian hypothesis-testing. The central idea of {ggstatsplot} is simple: combine these two phases into one in the form of graphics with statistical details, which makes data exploration simpler and faster. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AI Data Science Team

    AI Data Science Team

    An AI-powered data science team of agents

    AI Data Science Team is a Python library and agent ecosystem designed to accelerate and automate common data science workflows by modeling them as specialized AI “agents” that can be orchestrated to perform tasks like data cleaning, transformation, analysis, visualization, and machine learning. It provides a modular agent framework where each agent focuses on a step in the typical data science pipeline — for example, loading data from CSV/Excel files, cleaning and wrangling messy datasets, engineering predictive features, building models with AutoML, connecting to SQL databases, and producing visual outputs — all driven by natural language or programmatic instructions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MolecularGraph.jl

    MolecularGraph.jl

    Graph-based molecule modeling toolkit for cheminformatics

    MolecularGraph.jl is a graph-based molecule modeling and chemoinformatics analysis toolkit fully implemented in Julia.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ComponentArrays.jl

    ComponentArrays.jl

    Arrays with arbitrarily nested named components

    The main export of this package is the ComponentArray type. "Components" of ComponentArrays are really just array blocks that can be accessed through a named index. This will create a new ComponentArray whose data is a view into the original, allowing for standalone models to be composed together by simple function composition. In essence, ComponentArrays allow you to do the things you would usually need a modeling language for, but without actually needing a modeling language. The main targets are for use in DifferentialEquations.jl and Optim.jl, but anything that requires flat vectors is fair game.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    JUDI.jl

    JUDI.jl

    Julia Devito inversion

    JUDI is a framework for large-scale seismic modeling and inversion and is designed to enable rapid translations of algorithms to fast and efficient code that scales to industry-size 3D problems. The focus of the package lies on seismic modeling as well as PDE-constrained optimization such as full-waveform inversion (FWI) and imaging (LS-RTM). Wave equations in JUDI are solved with Devito, a Python domain-specific language for automated finite-difference (FD) computations. JUDI's modeling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Mongoose

    Mongoose

    Elegant mongodb object modeling for node.js

    Mongoose is a MongoDB object modeling tool that was built to answer the need for better ways to model your application data. It's designed to work in an asynchronous environment, providing a simple, straightforward approach to object modeling that skips out on the tedious tasks of writing MongoDB validation, casting and business logic boilerplate. Mongoose offers an uncomplicated schema-based solution, and comes with nifty features like type casting, validation, query building, and business logic hooks right out of the box. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 20
    SuperSplat

    SuperSplat

    3D Gaussian Splat Editor

    SuperSplat is a free and open source tool for inspecting and editing 3D Gaussian Splats. It is built on web technologies and runs in the browser, so there's nothing to download or install.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 21
    LIFELINES

    LIFELINES

    Survival analysis in Python

    ...Built-in plotting methods and datasets help users explore survival curves and compare groups visually. It is a practical tool for analysts, researchers, and data scientists who need event-time modeling without leaving Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PyVista

    PyVista

    3D plotting and mesh analysis through a streamlined interface

    3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK). PyVista is a helper module for the Visualization Toolkit (VTK) that takes a different approach on interfacing with VTK through NumPy and direct array access. This package provides a Pythonic, well-documented interface exposing VTK’s powerful visualization backend to facilitate rapid prototyping, analysis, and visual integration of spatially referenced datasets. This module can be used for...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    AI Hedge Fund

    AI Hedge Fund

    An AI Hedge Fund Team

    ...The project underlines AI’s potential in investment strategies but also carries disclaimers that it is for research and not financial advice. The implementation is designed so developers can study the pipeline end-to-end: from data ingestion through modeling to simulated portfolio management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Robyn

    Robyn

    Experimental, AI/ML-powered and open sourced Marketing Mix Modeling

    Robyn is an open-source, AI/ML-powered Marketing Mix Modeling (MMM) toolkit developed by Meta Marketing Science under the “facebookexperimental” GitHub umbrella. Its goal is to democratize rigorous MMM: what traditionally required expert statisticians and expensive consulting becomes accessible to any company with data. Robyn takes in historical data (spends on different marketing channels, conversions, or revenue, and optional context or organic-media variables) and uses a combination of techniques, regularized regression (Ridge), time-series decomposition (trend, seasonality, holiday effects), and hyperparameter optimization (via evolutionary algorithms), to estimate the incremental impact of each marketing channel. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PyMC

    PyMC

    Bayesian Modeling and Probabilistic Programming in Python

    PyMC is a Python library for probabilistic programming focused on Bayesian statistical modeling and machine learning. Built on top of computational tools like Aesara and NumPy, PyMC allows users to define models using intuitive syntax and perform inference using MCMC, variational inference, and other advanced algorithms. It’s widely used in scientific research, data science, and decision modeling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB