Showing 35 open source projects for "data modeling"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    Open source machine learning and data visualization. Build data analysis workflows visually, with a large, diverse toolbox. Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Even your multidimensional data can become sensible in 2D, especially with clever attribute ranking and selections. Interactive data exploration for...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 2
    Python for Data Analysis

    Python for Data Analysis

    Materials and IPython notebooks for "Python for Data Analysis"

    Python for Data Analysis is the official companion repository for Python for Data Analysis, 3rd Edition by Wes McKinney. It contains the datasets, examples, and IPython notebooks used throughout the book. The repository helps readers practice Python data analysis concepts directly in Jupyter Notebook. Its chapters cover Python basics, NumPy, pandas, data loading, cleaning, wrangling, visualization, time series, modeling libraries, and full analysis examples. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    AI Data Science Team

    AI Data Science Team

    An AI-powered data science team of agents

    AI Data Science Team is a Python library and agent ecosystem designed to accelerate and automate common data science workflows by modeling them as specialized AI “agents” that can be orchestrated to perform tasks like data cleaning, transformation, analysis, visualization, and machine learning. It provides a modular agent framework where each agent focuses on a step in the typical data science pipeline — for example, loading data from CSV/Excel files, cleaning and wrangling messy datasets, engineering predictive features, building models with AutoML, connecting to SQL databases, and producing visual outputs — all driven by natural language or programmatic instructions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PyVista

    PyVista

    3D plotting and mesh analysis through a streamlined interface

    3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK). PyVista is a helper module for the Visualization Toolkit (VTK) that takes a different approach on interfacing with VTK through NumPy and direct array access. This package provides a Pythonic, well-documented interface exposing VTK’s powerful visualization backend to facilitate rapid prototyping, analysis, and visual integration of spatially referenced datasets. This module can be used for...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    LIFELINES

    LIFELINES

    Survival analysis in Python

    ...Built-in plotting methods and datasets help users explore survival curves and compare groups visually. It is a practical tool for analysts, researchers, and data scientists who need event-time modeling without leaving Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    SDGym

    SDGym

    Benchmarking synthetic data generation methods

    The Synthetic Data Gym (SDGym) is a benchmarking framework for modeling and generating synthetic data. Measure performance and memory usage across different synthetic data modeling techniques – classical statistics, deep learning and more! The SDGym library integrates with the Synthetic Data Vault ecosystem. You can use any of its synthesizers, datasets or metrics for benchmarking.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Elementary

    Elementary

    Open-source data observability for analytics engineers

    Elementary data monitors are configured and executed like native tests in dbt your project. Uploading and modeling of dbt artifacts, run and test results to tables as part of your runs. Get informative notifications on data issues, schema changes, models and tests failures. Inspect upstream and downstream dependencies to understand impact and root cause of data issues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CleanVision

    CleanVision

    Automatically find issues in image datasets

    ...CleanVision helps you automatically identify common types of data issues lurking in image datasets. This package currently detects issues in the raw images themselves, making it a useful tool for any computer vision task such as: classification, segmentation, object detection, pose estimation, keypoint detection, generative modeling, etc.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    GMAT

    GMAT

    General Mission Analysis Tool

    The General Mission Analysis Tool (GMAT) is an open-source tool for space mission design and navigation. GMAT is developed by a team of NASA, private industry, and public and private contributors. The GMAT development team is pleased to announce the release of GMAT version R2026a. For a complete list of new features, compatibility changes, and bug fixes, see the R2026a Release Notes in the Users Guide.
    Leader badge
    Downloads: 1,320 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Avogadro

    Avogadro

    An intuitive molecular editor and visualization tool

    Avogadro is an advanced molecular editor designed for cross-platform use in computational chemistry, molecular modeling, bioinformatics, materials science and related areas. It offers a flexible rendering framework and a powerful plugin architecture.
    Leader badge
    Downloads: 774 This Week
    Last Update:
    See Project
  • 11

    DataPrep

    Python-based data preprocessing tool

    DataPrep v0.2 is a Tkinter-based GUI application/tool designed to assist users in data preprocessing, multicollinearity removal, and feature selection for a wide range of applications in Cheminformatics, Bioinformatics, Data Analysis, Feature Selection, Molecular Modeling, Machine Learning, and Quantitative-structure-property relationship (QSPR) studies. It includes functionality to load, process, and save datasets with support for different preprocessing & multicollinearity removal strategies with customizable parameter setting options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CSVSplitter
    # CSV Splitter Uma ferramenta para dividir arquivos CSV em múltiplos arquivos com base na quantidade de registros especificada, mantendo a integridade dos dados e permitindo configurações de charset, separador e formatação. Ideal para lidar com grandes arquivos CSV que precisam ser fragmentados para melhor manuseio e processamento. ## Funcionalidades - **Divisão de CSV**: Divide o arquivo original em múltiplos arquivos CSV, com o número de registros por arquivo definido pelo...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    FreeSEM

    Free and open-source desktop application designed for SEM

    ...It also enables exporting analysis results and reports to formats like Word, Excel, CSV, and PDF, making it useful for academic research and data analysis workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Uranie

    Uranie

    Uranie is CEA's uncertainty analysis platform, based on ROOT

    Uranie is a sensitivity and uncertainty analysis plateform based on the ROOT framework (http://root.cern.ch) . It is developed at CEA, the French Atomic Energy Commission (http://www.cea.fr). It provides various tools for: - data analysis - sampling - statistical modeling - optimisation - sensitivity analysis - uncertainty analysis - running code on high performance computers - etc. Thanks to ROOT, it is easily scriptable in CINT (c++ like syntax) and Python. Is is available both for Unix and Windows platforms (a dedicated platform archive is available on request). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The Python Computer Graphics Kit is a collection of Python modules that contain the basic types and functions to be able to create 3D computer graphics images (focusing on Pixar's RenderMan interface).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    LIFETIMES

    LIFETIMES

    Lifetime value in Python

    LIFETIMES is a Python library for customer lifetime value and repeat purchase behavior modeling. It helps analysts estimate how frequently customers may return, how long they may remain active, and how much value they may generate over time. The library is built around probabilistic models commonly used in customer analytics, including transaction frequency and monetary value modeling. It is useful for ecommerce, subscription-adjacent businesses, retail analytics, and retention analysis. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GEOMS2

    GEOMS2

    Geostatistics and geosciences modeling software

    GEOMS2 is a geostatistics and geosciences modeling software. Provides interface for grid (mesh), point, surface and data (non-spatial) objects. It has a 3D viewer and 2D plots using the well known Python engines Mayavi and Matplotlib. It has several functions to manipulate your data as well as provide univariate and multivariate analysis. NOTE: The software is still an early beta.
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 18
    PyMOL Molecular Graphics System

    PyMOL Molecular Graphics System

    PyMOL is an OpenGL based molecular visualization system

    The Open-Source PyMOL repository has been moved to github: https://github.com/schrodinger/pymol-open-source We still use the pymol-users mailing list here on sourceforge. Please subscribe for community support: https://pymol.org/maillist (Note: SourceForge email newsletter and special offers are optional and can be unchecked) The PyMOL community wiki has its own home: https://pymolwiki.org/
    Downloads: 53 This Week
    Last Update:
    See Project
  • 19
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    ...Of course you may specify JASP for advanced data editing and RapidMiner for advanced prediction modeling. DSTK is written in C#, Java and Python to interface with R, NLTK, and Weka. It can be expanded with plugins using R Scripts. We have also created plugins for more statistical functions, and Big Data Analytics with Microsoft Azure HDInsights (Spark Server) with Livy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    K-3D is the free (as in freedom) 3d modeling, animation, and rendering system.
    Leader badge
    Downloads: 82 This Week
    Last Update:
    See Project
  • 21
    PROJECT MOVED: see http://www.vmtk.org for up-to-date information. Libraries and tools for 3D reconstruction, geometric analysis, mesh generation and surface data analysis in image-based blood vessel modeling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    pyFormex is a program for generating, transforming and manipulating large geometrical models of 3D structures by sequences of mathematical operations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project examines techniques to model three-dimensional rigid body motion using the geometric algebra of Dual Quaternions and how such models compare to more traditional models when used in underconstrained filtering applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Gaphor is a UML modeling environment written in Python. Gaphor is small and very extensible. The repository is located at http://github.com/gaphor/gaphor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Python library for parsing, processing and linearising VRML97 files
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB