Showing 106 open source projects for "statistical"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    AutoViz

    AutoViz

    Automatically Visualize any dataset, any size

    ...The system also includes built-in tools for evaluating data quality and identifying potential issues such as missing values or unusual distributions. By automating the visualization process, AutoViz allows users to rapidly explore datasets before applying machine learning models or statistical analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    NeuralForecast

    NeuralForecast

    Scalable and user friendly neural forecasting algorithms.

    ...Unfortunately, available implementations and published research are yet to realize neural networks' potential. They are hard to use and continuously fail to improve over statistical methods while being computationally prohibitive. For this reason, we created NeuralForecast, a library favoring proven accurate and efficient models focusing on their usability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    YData Synthetic

    YData Synthetic

    Synthetic data generators for tabular and time-series data

    A package to generate synthetic tabular and time-series data leveraging state-of-the-art generative models. Synthetic data is artificially generated data that is not collected from real-world events. It replicates the statistical components of real data without containing any identifiable information, ensuring individuals' privacy. This repository contains material related to Generative Adversarial Networks for synthetic data generation, in particular regular tabular data and time-series. It consists a set of different GANs architectures developed using Tensorflow 2.0. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Synthetic Data Vault (SDV)

    Synthetic Data Vault (SDV)

    Synthetic Data Generation for tabular, relational and time series data

    The Synthetic Data Vault (SDV) is a Synthetic Data Generation ecosystem of libraries that allows users to easily learn single-table, multi-table and timeseries datasets to later on generate new Synthetic Data that has the same format and statistical properties as the original dataset. Synthetic data can then be used to supplement, augment and in some cases replace real data when training Machine Learning models. Additionally, it enables the testing of Machine Learning or other data dependent software systems without the risk of exposure that comes with data disclosure. Underneath the hood it uses several probabilistic graphical modeling and deep learning based techniques. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    mlforecast

    mlforecast

    Scalable machine learning for time series forecasting

    mlforecast is a time-series forecasting framework built around machine-learning models, designed to make forecasting both efficient and scalable. It lets you apply any regressor that follows the typical scikit-learn API, for example, gradient-boosted trees or linear models, to time-series data by automating much of the messy feature engineering and data preparation. Instead of writing custom code to build lagged features, rolling statistics, and date-based predictors, mlforecast generates...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Darts

    Darts

    A python library for easy manipulation and forecasting of time series

    darts is a Python library for easy manipulation and forecasting of time series. It contains a variety of models, from classics such as ARIMA to deep neural networks. The models can all be used in the same way, using fit() and predict() functions, similar to scikit-learn. The library also makes it easy to backtest models, combine the predictions of several models, and take external data into account. Darts supports both univariate and multivariate time series and models. The ML-based models...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Stake Crash Predictor

    Stake Crash Predictor

    Stake Crash Predictor is a toolkit for stake mines predictor & Plinko.

    The Stake Crash Predictor is a focused toolkit that combines statistical analysis, optional server fairness seed hash decrypt helpers, and AI-assisted summaries to help you study rounds on Stake.us. This project centers on the stake mines predictor and stake predictor workflows Demo-focused stake crash predictor app — seed-inspection helpers (SHA-512 / SHA-256), AI-assisted summaries, and demo bot templates for stake mines predictor too, Start in demo mode to test safely. ...
    Leader badge
    Downloads: 144 This Week
    Last Update:
    See Project
  • 8
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    ...SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. SPPAS offers a file converter from/to a wide range of formats: xra, TextGrid, eaf, trs... <https://sppas.org>
    Downloads: 36 This Week
    Last Update:
    See Project
  • 9

    WhisperJAV

    A subtitle generator for Japanese Adult Videos.

    ...Transformer-based ASR architectures like Whisper suffer significant performance degradation when applied to the spontaneous and noisy domain of JAV. This degradation is driven by specific acoustic and temporal characteristics that defy the statistical distributions of standard training data.
    Leader badge
    Downloads: 52 This Week
    Last Update:
    See Project
  • 10
    Dodge gpt

    Dodge gpt

    Bypass Ai content for GPTZero and others making text Undetectable

    *New Update* ╔════════════════════════════════════════════════════════════════╗ ║ DODGE V10 - STEALTH EDITION ║ The Only AI Text Humanizer That Defeats GPTZero ╚════════════════════════════════════════════════════════════════╝ █████████████████████████████████████████████████████████████████ █ █ █ 🛡️ CURRENT STATUS: GPTZERO RESISTANT - VERIFIED 2026 █ █ 📊 SUCCESS RATE: 60.7% AGAINST ALL DETECTORS █ █ 🔬 BASED ON: REAL HUMAN CORPUS ANALYSIS █ █ █ █████████████████████████████████████████████████████████████████ Dodge V10 isn't just another "synonym replacer" or "typo adder". It is a sophisticated neural text transformation engine that rewrites content to match the EXACT statistical fingerprint of human writing. -----100% Free-----
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Uranie

    Uranie

    Uranie is CEA's uncertainty analysis platform, based on ROOT

    Uranie is a sensitivity and uncertainty analysis plateform based on the ROOT framework (http://root.cern.ch) . It is developed at CEA, the French Atomic Energy Commission (http://www.cea.fr). It provides various tools for: - data analysis - sampling - statistical modeling - optimisation - sensitivity analysis - uncertainty analysis - running code on high performance computers - etc. Thanks to ROOT, it is easily scriptable in CINT (c++ like syntax) and Python. Is is available both for Unix and Windows platforms (a dedicated platform archive is available on request). Note : if you have downloaded version 3.12 before the 8th of february, a patch exists for a minor bug on TOutputFileKey file, don't hesitate to ask us.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    stkpp

    stkpp

    C++ Statistical ToolKit

    STK++ (http://www.stkpp.org) is a versatile, fast, reliable and elegant collection of C++ classes for statistics, clustering, linear algebra, arrays (with an Eigen-like API), regression, dimension reduction, etc. Some functionalities provided by the library are available in the R environment as R functions (http://cran.at.r-project.org/web/packages/rtkore/index.html). At a convenience, we propose the source packages on sourceforge. The library offers a dense set of (mostly) template...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    sketch

    sketch

    AI code-writing assistant that understands data content

    Sketch is an open-source AI-powered data analysis assistant designed specifically for pandas users, enabling natural language interaction with tabular datasets to generate code, insights, and transformations. It works by summarizing the structure and statistical properties of a dataset and providing that context to a language model, allowing it to generate highly relevant and accurate responses tailored to the data. The tool integrates directly into pandas dataframes through an extension, making it easy to use within existing Python workflows without requiring additional IDE plugins. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    pattern_classification

    pattern_classification

    A collection of tutorials and examples for solving machine learning

    The pattern_classification repository is an educational project that provides tutorials, examples, and reference materials related to machine learning and statistical pattern recognition. The project aims to help learners understand the process of building predictive models by presenting structured explanations and practical examples. It includes notebooks and guides that demonstrate data preprocessing, feature extraction, model training, and evaluation techniques used in machine learning workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    tdsft

    tdsft

    TDSFT (Two-Dimensional Segmentation Fusion Tool)

    Given multiple segmentations related to the same tumor, statistical metrics could be exploited to determine the most reliable one, from there identified as ground truth. Numerous algorithms have been developed over time, but to date, there is no validated method for this procedure. Therefore, research is still active in this area. Two-Dimensional Segmentation Fusion Tool (TDSFT) is an open-source tool developed in MATLAB and distributed as a standalone application for MAC, Linux, and Windows, which offers a simple and extensible interface where numerous algorithms are proposed to "mediate" (e.g., process and fuse) multiple segmentations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    QuantResearch

    QuantResearch

    Quantitative analysis, strategies and backtests

    ...The project integrates machine learning methods with traditional quantitative finance models, illustrating how statistical techniques can be applied to asset management and trading.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Complete Machine Learning Package

    Complete Machine Learning Package

    A comprehensive machine learning repository containing 30+ notebooks

    Complete Machine Learning Package repository is a comprehensive educational collection of machine learning notebooks designed to teach core data science and AI concepts through practical coding examples. The project includes more than thirty notebooks that cover a wide range of topics including data analysis, statistical modeling, neural networks, and deep learning. Each notebook introduces theoretical ideas and then demonstrates how to implement them using Python libraries commonly used in data science, such as NumPy, pandas, scikit-learn, and TensorFlow. The repository also includes examples related to natural language processing, computer vision, and data visualization, giving learners exposure to several subfields of machine learning. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Lingua-Go

    Lingua-Go

    The most accurate natural language detection library for Go

    Lingua-Go is a Golang implementation of the Lingua language detection library, providing efficient and accurate language identification for Go-based applications. Its task is simple: It tells you which language some text is written in. This is very useful as a preprocessing step for linguistic data in natural language processing applications such as text classification and spell checking. Other use cases, for instance, might include routing e-mails to the right geographically located...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    fe4ml-zh

    fe4ml-zh

    Feature Engineering for Machine Learning

    ...The project explains techniques for creating, selecting, and transforming features in ways that improve model accuracy and robustness. It also discusses the role of domain knowledge, data preprocessing, and statistical reasoning in building effective machine learning models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    CausalNex

    CausalNex

    A Python library that helps data scientists to infer causation

    CausalNex is a Python library that uses Bayesian Networks to combine machine learning and domain expertise for causal reasoning. You can use CausalNex to uncover structural relationships in your data, learn complex distributions, and observe the effect of potential interventions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Merlion

    Merlion

    A Machine Learning Framework for Time Series Intelligence

    Merlion is a Python library for time series intelligence. It provides an end-to-end machine learning framework that includes loading and transforming data, building and training models, post-processing model outputs, and evaluating model performance. It supports various time series learning tasks, including forecasting, anomaly detection, and change point detection for both univariate and multivariate time series. This library aims to provide engineers and researchers a one-stop solution to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Machine Learning Git Codebook

    Machine Learning Git Codebook

    For extensive instructor led learning

    ...The repository organizes these topics into sequential notebooks that explain theoretical concepts while allowing users to experiment directly with code. Many lessons emphasize hands-on exercises where learners analyze datasets, implement algorithms, and evaluate results through visualizations and statistical metrics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ISLR-python

    ISLR-python

    An Introduction to Statistical Learning

    ISLR-python is an educational repository that provides Python implementations and notebooks corresponding to examples and exercises from the book An Introduction to Statistical Learning. The project recreates tables, figures, and laboratory exercises originally presented in the book so that readers can explore the concepts using Python rather than the original R environment. The repository includes Jupyter notebooks demonstrating statistical learning methods such as linear regression, classification algorithms, resampling methods, and model evaluation techniques. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Pattern Recognition and Machine Learning

    Pattern Recognition and Machine Learning

    Repository of notes, code and notebooks in Python

    ...The project recreates many of the mathematical concepts and diagrams from the book using executable Jupyter notebooks, allowing readers to experiment directly with the algorithms described in the text. Each section of the repository corresponds to chapters in the book and includes code examples that demonstrate statistical modeling, machine learning methods, and Bayesian inference techniques. These notebooks provide visualizations and computational demonstrations that help clarify complex topics such as probabilistic models, neural networks, kernel methods, and graphical models. The repository also includes implementations of sampling methods, clustering algorithms, and dimensionality reduction techniques used throughout machine learning research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Machine Learning in Asset Management

    Machine Learning in Asset Management

    Machine Learning in Asset Management

    ...The repository also includes references to academic research, tutorials, and datasets that help users understand how machine learning can enhance traditional investment strategies. Many of the experiments focus on applying supervised learning, reinforcement learning, and statistical modeling techniques to financial data. By combining theory, research papers, and practical implementations, the repository functions as both a learning platform and a research resource for quantitative finance.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB