Showing 27 open source projects for "data modeling"

View related business solutions
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    Synthetic Data Vault (SDV)

    Synthetic Data Vault (SDV)

    Synthetic Data Generation for tabular, relational and time series data

    ...Additionally, it enables the testing of Machine Learning or other data dependent software systems without the risk of exposure that comes with data disclosure. Underneath the hood it uses several probabilistic graphical modeling and deep learning based techniques. To enable a variety of data storage structures, we employ unique hierarchical generative modeling and recursive sampling techniques.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    SDGym

    SDGym

    Benchmarking synthetic data generation methods

    The Synthetic Data Gym (SDGym) is a benchmarking framework for modeling and generating synthetic data. Measure performance and memory usage across different synthetic data modeling techniques – classical statistics, deep learning and more! The SDGym library integrates with the Synthetic Data Vault ecosystem. You can use any of its synthesizers, datasets or metrics for benchmarking.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Frappe

    Frappe

    Low code web framework for real world applications

    Frappe is a full-stack, low-code web framework written in Python and JavaScript, used to build scalable and modular enterprise applications. It powers ERPNext and includes tools for REST APIs, user management, document modeling, workflows, and real-time updates. Frappe uses a "model-view-controller" approach with its own ORM and frontend system, enabling rapid development without sacrificing control or performance.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Meridian

    Meridian

    Meridian is an MMM framework

    ...The framework provides a robust foundation for constructing in-house MMM pipelines capable of handling both national and geo-level data, with built-in support for calibration using experimental data or prior knowledge. Meridian uses the No-U-Turn Sampler (NUTS) for Markov Chain Monte Carlo (MCMC) sampling to produce statistically rigorous results, and it includes GPU acceleration to significantly reduce computation time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Copulas

    Copulas

    A library to model multivariate data using copulas

    Copulas is a Python library for modeling multivariate distributions and sampling from them using copula functions. Given a table of numerical data, use Copulas to learn the distribution and generate new synthetic data following the same statistical properties. Choose from a variety of univariate distributions and copulas – including Archimedian Copulas, Gaussian Copulas and Vine Copulas.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    CTGAN

    CTGAN

    Conditional GAN for generating synthetic tabular data

    CTGAN is a collection of Deep Learning based synthetic data generators for single table data, which are able to learn from real data and generate synthetic data with high fidelity. If you're just getting started with synthetic data, we recommend installing the SDV library which provides user-friendly APIs for accessing CTGAN. The SDV library provides wrappers for preprocessing your data as well as additional usability features like constraints. When using the CTGAN library directly, you may...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    statsmodels

    statsmodels

    Statsmodels, statistical modeling and econometrics in Python

    statsmodels is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests, and statistical data exploration. An extensive list of result statistics are available for each estimator. The results are tested against existing statistical packages to ensure that they are correct. The package is released under the open source Modified BSD (3-clause) license. Generalized linear models with support for all...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Stock prediction deep neural learning

    Stock prediction deep neural learning

    Predicting stock prices using a TensorFlow LSTM

    Predicting stock prices can be a challenging task as it often does not follow any specific pattern. However, deep neural learning can be used to identify patterns through machine learning. One of the most effective techniques for series forecasting is using LSTM (long short-term memory) networks, which are a type of recurrent neural network (RNN) capable of remembering information over a long period of time. This makes them extremely useful for predicting stock prices. Predicting stock...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    RAG Anything

    RAG Anything

    RAG-Anything: All-in-One RAG Framework

    RAG-Anything is an open-source unified framework that extends the Retrieval-Augmented Generation (RAG) paradigm to fully multimodal document and knowledge retrieval, enabling systems to ingest, parse, represent, and query rich content that includes text, images, tables, formulas, and other structured or visual elements. Traditional RAG systems are typically limited to text and cannot effectively work across heterogeneous document layouts, but RAG-Anything addresses this by modeling multimodal content in ways that preserve cross-modal relationships and semantic context, often treating content elements as interconnected knowledge entities rather than separate data silos. The system uses a multi-stage pipeline (e.g., document parsing, content analysis, knowledge graph construction, intelligent retrieval) so queries can navigate across modalities with deeper understanding and relevance.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    MetricFlow

    MetricFlow

    MetricFlow allows you to define, build, and maintain metrics in code

    MetricFlow is an open-source semantic layer engine designed to help organizations define, manage, and query business metrics in a consistent, governed way. It works alongside a data stack—typically built with dbt—and allows you to express metrics as YAML‐based definitions tied to semantic models and dimension tables, rather than embedding logic ad-hoc across many dashboards or scripts. When a user or tool requests a metric (e.g., “monthly revenue by region”), MetricFlow generates optimized,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PoJamas aims to provide a Python and tools for loading, processing, and producing .cr2, pz3 (crz, pzz) files compatible with the SmithMicro (e-frontier) Poser character animation application. PoJamas is composed of: - Python library - Python Wavefront (.obj) 3D viewer based on GLFW - LibreOffice/Python Application (to ease the library and the viewer usage) As of 2020, the project is ported in Python3 As of 2021 this project proposes a 3D viewer for Wavefront files...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Twinify

    Twinify

    Privacy-preserving generation of a synthetic twin to a data set

    ...For the latter, twinify also offers automatic modeling for easy building of models fitting the data. If you have existing experience with NumPyro you can also implement your own model directly. Often data that would be very useful for the scientific community is subject to privacy regulations and concerns and cannot be shared. Differentially private data sharing allows generating of synthetic data that is statistically similar to the original data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    itamm

    itamm

    Tool to design and share enterprise solutions, services and processes

    The tool is for people who design, analyze, optimize and develop processes, services and solution architectures. IT(A)-MM is a tool to design models of solutions, services and enterprise processes. It allows you to visualize data using popular BPMN and ArchiMate visualization notation. It also has its own extensible notation for visualizing enterprise environment objects. IT(A)-MM is easy to use and allows you to use it wherever you are. Using IT(A)-MM can be the first step towards deploy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    BMC

    BMC

    Notes on Scientific Computing for Biomechanics

    This repository is a collection of lecture notes and code on scientific computing and data analysis for Biomechanics and Motor Control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Makani

    Makani

    Makani was developed a commercial-scale airborne wind turbine

    Makani was an ambitious Google X project that sought to harness wind energy using airborne wind turbines — autonomous kites capable of generating power while flying in crosswind patterns. This open-source repository contains the complete software stack that powered Makani’s research and flight systems, including the flight simulator, autopilot controller, avionics firmware, visualization tools, and ground control software. The software enables simulation, control, and analysis of the Makani...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Albedo

    Albedo

    A recommender system for discovering GitHub repos

    Albedo is an open-source recommender system aimed at helping developers discover GitHub repositories by learning from activity signals. It treats repositories and developers as a graph of interactions and applies large-scale matrix factorization to model affinities, with Apache Spark providing the distributed data processing. The project focuses on implicit feedback—stars, watches, and other engagement metrics—so it can build useful recommendations without explicit ratings. A reproducible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    data-science-ipython-notebooks

    data-science-ipython-notebooks

    Data science Python notebooks: Deep learning

    Data Science IPython Notebooks is a broad, curated set of Jupyter notebooks covering Python, data wrangling, visualization, machine learning, deep learning, and big data tools. It aims to be a practical map of the ecosystem, showing hands-on examples with libraries such as NumPy, pandas, matplotlib, scikit-learn, and others. Many notebooks introduce concepts step by step, then apply them to real datasets so readers can see techniques in action. Advanced sections touch on neural networks and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Importer library to import assets from different common 3D file formats such as Collada, Blend, Obj, X, 3DS, LWO, MD5, MD2, MD3, MDL, MS3D and a lot of other formats. The data is stored in an own in-memory data-format, which can be easily processed. www.open3mod.com/ is a 3D model viewer and exporter based on Assimp that is also Open Source.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 21
    Math tools in Python to tackle down problems in Operational Research fields. Comes with a Django based web interface to allow remote access to complex simulation means.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    aim is to provide support for monitoring system behaviour through dynamic analysis, and for analysing and modeling this data for a better understanding of system behaviour and for test automation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project examines techniques to model three-dimensional rigid body motion using the geometric algebra of Dual Quaternions and how such models compare to more traditional models when used in underconstrained filtering applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Gaphor is a UML modeling environment written in Python. Gaphor is small and very extensible. The repository is located at http://github.com/gaphor/gaphor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This program generates customizable hyper-surfaces (multi-dimensional input and output) and samples data from them to be used further as benchmark for response surface modeling tasks or optimization algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB