Showing 55 open source projects for "data modeling"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    GNNPCSAFT

    GNNPCSAFT

    Smart Thermodynamic Modeling with Graph Neural Networks

    The GNNPCSAFT app is an implementation of our project that focuses on using Graph Neural Networks (GNN) to estimate the pure-component parameters of the Equation of State PC-SAFT. We developed this app so the scientific community can access the model's results easily. In this app, the estimated pure-component parameters can be used to calculate thermodynamic properties and compare them with experimental data from the ThermoML Archive. To install the GNNPCSAFT app, download the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    UnBBayes

    UnBBayes

    Framework & GUI for Bayes Nets and other probabilistic models.

    UnBBayes is a probabilistic network framework written in Java. It has both a GUI and an API with inference, sampling, learning and evaluation. It supports Bayesian networks, influence diagrams, MSBN, OOBN, HBN, MEBN/PR-OWL, PRM, structure, parameter and incremental learning. Please, visit our wiki (https://sourceforge.net/p/unbbayes/wiki/Home/) for more information. Check out the license section (https://sourceforge.net/p/unbbayes/wiki/License/) for our licensing policy.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 3
    GNNPCSAFT Web App

    GNNPCSAFT Web App

    Smart Thermodynamic Modeling with Graph Neural Networks

    The GNNPCSAFT Web App is an implementation of our project that focuses on using Graph Neural Networks (GNN) to estimate the pure-component parameters of the Equation of State PC-SAFT. We developed this app so the scientific community can access the model's results easily. In this app, the estimated pure-component parameters can be used to calculate thermodynamic properties and compare them with experimental data from the ThermoML Archive. More info on github repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Complete Machine Learning Package

    Complete Machine Learning Package

    A comprehensive machine learning repository containing 30+ notebooks

    Complete Machine Learning Package repository is a comprehensive educational collection of machine learning notebooks designed to teach core data science and AI concepts through practical coding examples. The project includes more than thirty notebooks that cover a wide range of topics including data analysis, statistical modeling, neural networks, and deep learning. Each notebook introduces theoretical ideas and then demonstrates how to implement them using Python libraries commonly used in data science, such as NumPy, pandas, scikit-learn, and TensorFlow. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    QuantResearch

    QuantResearch

    Quantitative analysis, strategies and backtests

    ...These include implementations of factor models, statistical arbitrage strategies, portfolio optimization methods, and reinforcement learning approaches to trading. The repository also explores financial modeling topics such as vector autoregression, Gaussian mixture models, and option pricing techniques. Many notebooks demonstrate backtesting pipelines that allow users to evaluate trading strategies using historical market data. The project integrates machine learning methods with traditional quantitative finance models, illustrating how statistical techniques can be applied to asset management and trading.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    SGX-Full-OrderBook-Tick-Data-Trading

    SGX-Full-OrderBook-Tick-Data-Trading

    Providing the solutions for high-frequency trading (HFT) strategies

    SGX-Full-OrderBook-Tick-Data-Trading-Strategy is an open-source research project focused on modeling high-frequency financial market behavior using machine learning techniques. The repository analyzes tick-level order book data from the Singapore Exchange and attempts to capture the dynamics of limit order book movements. By extracting features such as order depth ratios and price movement indicators, the system trains machine learning models to predict short-term market changes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    OmicSelector

    OmicSelector

    Feature selection and deep learning modeling for omic biomarker study

    OmicSelector is an environment, Docker-based web application, and R package for biomarker signature selection (feature selection) from high-throughput experiments and others. It was initially developed for miRNA-seq (small RNA, smRNA-seq; hence the name was miRNAselector), RNA-seq and qPCR, but can be applied for every problem where numeric features should be selected to counteract overfitting of the models. Using our tool, you can choose features, like miRNAs, with the most significant...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Pattern

    Pattern

    Web mining module for Python, with tools for scraping

    Pattern is an open-source Python library that provides tools for web mining, natural language processing, machine learning, and network analysis. The project integrates multiple capabilities into a single framework that allows developers to collect, process, and analyze textual data from the web. It includes modules for web scraping and crawling that can retrieve information from sources such as social media platforms, search engines, and online knowledge bases. In addition to data mining features, the library offers natural language processing functionality including part-of-speech tagging, sentiment analysis, and n-gram extraction. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    pyntcloud

    pyntcloud

    pyntcloud is a Python library for working with 3D point clouds

    This page will introduce the general concept of point clouds and illustrate the capabilities of pyntcloud as a point cloud processing tool. Point clouds are one of the most relevant entities for representing three dimensional data these days, along with polygonal meshes (which are just a special case of point clouds with connectivity graph attached). In its simplest form, a point cloud is a set of points in a cartesian coordinate system. Accurate 3D point clouds can nowadays be (easily and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Guia do Cientista de Dados das Galáxias

    Guia do Cientista de Dados das Galáxias

    Repository for gathering information on study materials

    Guia do Cientista de Dados das Galáxias is an open-source community repository that aggregates educational resources, tools, and references related to data science, machine learning, and analytics. The project was created by the Pizza de Dados community with the goal of organizing useful materials for people interested in learning or working in the data science ecosystem. The repository collects links to books, podcasts, tutorials, datasets, communities, and study groups that can help...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Machine Learning in Asset Management

    Machine Learning in Asset Management

    Machine Learning in Asset Management

    ...The project collects educational materials, code implementations, and experiments related to applying artificial intelligence methods in financial markets. It covers topics such as predictive modeling for asset prices, portfolio optimization strategies, and risk management using machine learning algorithms. The repository also includes references to academic research, tutorials, and datasets that help users understand how machine learning can enhance traditional investment strategies. Many of the experiments focus on applying supervised learning, reinforcement learning, and statistical modeling techniques to financial data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    scikit-learn tips

    scikit-learn tips

    50 scikit-learn tips

    scikit-learn-tips is an educational repository that collects practical advice and best practices for using the scikit-learn machine learning library effectively. The project consists of short explanations and examples that highlight common patterns, pitfalls, and techniques used when building machine learning workflows in Python. Each tip typically demonstrates how specific components of scikit-learn, such as pipelines, preprocessing utilities, or model evaluation tools, should be applied in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SparrowRecSys

    SparrowRecSys

    A Deep Learning Recommender System

    SparrowRecSys is an open-source deep learning recommendation system framework designed to demonstrate the architecture and implementation of modern industrial-scale recommender systems. The project integrates multiple machine learning models and data processing pipelines to simulate how real-world recommendation platforms operate. It includes components for offline data processing, feature engineering, model training, real-time data updates, and online recommendation services. SparrowRecSys supports a wide range of state-of-the-art recommendation algorithms, including models for click-through rate prediction and user behavior modeling that are widely used in advertising and content recommendation systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Texthero

    Texthero

    Text preprocessing, representation and visualization from zero to hero

    Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MLOps Course

    MLOps Course

    Learn how to design, develop, deploy and iterate on ML apps

    The MLOps Course by Goku Mohandas is an open-source curriculum that teaches how to combine machine learning with solid software engineering to build production-grade ML applications. It is structured around the full lifecycle: data pipelines, modeling, experiment tracking, deployment, testing, monitoring, and iteration. The repository itself contains configuration, code examples, and links to accompanying lessons hosted on the Made With ML site, which provide detailed narrative explanations and diagrams. Instead of focusing only on model training, the course emphasizes best practices like modular code design, CI/CD, containerization, reproducibility, and responsible ML (including monitoring and feedback loops). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    RoboSat

    RoboSat

    Semantic segmentation on aerial and satellite imagery

    RoboSat is an end-to-end pipeline written in Python 3 for feature extraction from aerial and satellite imagery. Features can be anything visually distinguishable in the imagery for example: buildings, parking lots, roads, or cars.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Machine Learning Mindmap

    Machine Learning Mindmap

    A mindmap summarising Machine Learning concepts

    ...The project organizes a wide range of machine learning topics into an interconnected diagram that helps learners understand how concepts relate to one another across the broader field of artificial intelligence. The mind map covers fundamental areas such as data preprocessing, statistical analysis, supervised learning, unsupervised learning, reinforcement learning, and deep learning architectures. By arranging these concepts visually, the repository allows students and practitioners to quickly explore the relationships between algorithms, techniques, and modeling approaches used in modern machine learning workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TenorSpace.js

    TenorSpace.js

    Neural network 3D visualization framework

    TensorSpace is a neural network 3D visualization framework built using TensorFlow.js, Three.js and Tween.js. TensorSpace provides Keras-like APIs to build deep learning layers, load pre-trained models, and generate a 3D visualization in the browser. From TensorSpace, it is intuitive to learn what the model structure is, how the model is trained and how the model predicts the results based on the intermediate information. After preprocessing the model, TensorSpace supports the visualization...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    DS-Take-Home

    DS-Take-Home

    Solution to the book A Collection of Data Science Take-Home Challenge

    ...The problems cover a broad set of applied data science topics including conversion rate analysis, fraud detection, employee retention modeling, marketing campaign evaluation, and recommendation-style problems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Learn_Data_Science_in_3_Months

    Learn_Data_Science_in_3_Months

    This is the Curriculum for "Learn Data Science in 3 Months"

    This project lays out a 12-week plan to go from basics to a portfolio-ready understanding of data science. It breaks the journey into clear stages: Python fundamentals, data wrangling, visualization, statistics, machine learning, and end-to-end projects. The schedule mixes learning and doing, encouraging you to build small deliverables each week—like notebooks, dashboards, and model demos—to reinforce skills. It also includes suggestions for datasets and problem domains so you aren’t stuck...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Convolutional Recurrent Neural Network

    Convolutional Recurrent Neural Network

    Convolutional Recurrent Neural Network (CRNN) for image-based sequence

    Convolutional Recurrent Neural Network provides an implementation of the Convolutional Recurrent Neural Network (CRNN) architecture, a deep learning model designed for image-based sequence recognition tasks such as optical character recognition and scene text recognition. The architecture combines convolutional neural networks for extracting visual features from images with recurrent neural networks that model sequential dependencies in the extracted features. This hybrid approach allows the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Edward

    Edward

    A probabilistic programming language in TensorFlow

    A library for probabilistic modeling, inference, and criticism. Edward is a Python library for probabilistic modeling, inference, and criticism. It is a testbed for fast experimentation and research with probabilistic models, ranging from classical hierarchical models on small data sets to complex deep probabilistic models on large data sets. Edward fuses three fields, Bayesian statistics and machine learning, deep learning, and probabilistic programming. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    ...Of course you may specify JASP for advanced data editing and RapidMiner for advanced prediction modeling. DSTK is written in C#, Java and Python to interface with R, NLTK, and Weka. It can be expanded with plugins using R Scripts. We have also created plugins for more statistical functions, and Big Data Analytics with Microsoft Azure HDInsights (Spark Server) with Livy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Apache PredictionIO

    Apache PredictionIO

    Machine learning server for developers and ML engineers

    Apache PredictionIO® is an open source Machine Learning Server built on top of a state-of-the-art open source stack for developers and data scientists to create predictive engines for any machine learning task. Quickly build and deploy an engine as a web service on production with customizable templates; respond to dynamic queries in real-time once deployed as a web service; evaluate and tune multiple engine variants systematically; unify data from multiple platforms in batch or in real-time for comprehensive predictive analytics; speed up machine learning modeling with systematic processes and pre-built evaluation measures; support machine learning and data processing libraries such as Spark MLLib and OpenNLP; implement your own machine learning models and seamlessly incorporate them into your engine; simplify data infrastructure management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PredictionIO

    PredictionIO

    Machine learning server for building predictive applications

    Apache PredictionIO is an open-source machine learning server designed to simplify the process of building and deploying predictive engines. It offers a scalable infrastructure with support for multiple ML algorithms, event data collection, and deployment workflows. Developers can use templates or build custom engines, making it a flexible solution for integrating machine learning into applications.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB