Search Results for "statistical distributions"

Showing 45 open source projects for "statistical distributions"

View related business solutions
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Book5_Essentials-Probability-Statistics

    Book5_Essentials-Probability-Statistics

    The book 5 of statistics in simplicity

    Book5_Essentials-of-Probability-and-Statistics is a Visualize-ML educational volume that introduces the statistical and probabilistic concepts underpinning modern data analysis and machine learning. The repository explains topics such as distributions, sampling, inference, and uncertainty using visual demonstrations and intuitive narratives. Its teaching philosophy prioritizes conceptual clarity over heavy formalism, making statistical thinking more approachable for beginners. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    ggstatsplot

    ggstatsplot

    Enhancing {ggplot2} plots with statistical analysis

    {ggstatsplot} is an extension of {ggplot2} package for creating graphics with details from statistical tests included in the information-rich plots themselves. In a typical exploratory data analysis workflow, data visualization and statistical modeling are two different phases: visualization informs modeling, and modeling in its turn can suggest a different visualization method, and so on and so forth. Bayesian hypothesis-testing. The central idea of {ggstatsplot} is simple: combine these...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    statsmodels

    statsmodels

    Statsmodels, statistical modeling and econometrics in Python

    statsmodels is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests, and statistical data exploration. An extensive list of result statistics are available for each estimator. The results are tested against existing statistical packages to ensure that they are correct. The package is released under the open source Modified BSD (3-clause) license.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    TensorFlow Probability

    TensorFlow Probability

    Probabilistic reasoning and statistical analysis in TensorFlow

    ...Tools to build deep probabilistic models, including probabilistic layers and a `JointDistribution` abstraction. Variational inference and Markov chain Monte Carlo. A wide selection of probability distributions and bijectors. Optimizers such as Nelder-Mead, BFGS, and SGLD.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Copulas

    Copulas

    A library to model multivariate data using copulas

    Copulas is a Python library for modeling multivariate distributions and sampling from them using copula functions. Given a table of numerical data, use Copulas to learn the distribution and generate new synthetic data following the same statistical properties. Choose from a variety of univariate distributions and copulas – including Archimedian Copulas, Gaussian Copulas and Vine Copulas. Compare real and synthetic data visually after building your model.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Smile

    Smile

    Statistical machine intelligence and learning engine

    Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    MathPHP

    MathPHP

    Powerful modern math library for PHP

    Math PHP is a library that brings advanced mathematical functions and data analysis capabilities to PHP applications. It covers a wide range of topics, including linear algebra, calculus, statistics, probability, and numerical analysis. Math PHP is designed for developers and data scientists who require precise and efficient mathematical computations in PHP, making it suitable for scientific computing and data processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    PyMC

    PyMC

    Bayesian Modeling and Probabilistic Programming in Python

    PyMC is a Python library for probabilistic programming focused on Bayesian statistical modeling and machine learning. Built on top of computational tools like Aesara and NumPy, PyMC allows users to define models using intuitive syntax and perform inference using MCMC, variational inference, and other advanced algorithms. It’s widely used in scientific research, data science, and decision modeling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    Open source machine learning and data visualization. Build data analysis workflows visually, with a large, diverse toolbox. Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Even your multidimensional data can become sensible in 2D, especially with clever attribute ranking and selections. Interactive data exploration for rapid qualitative analysis with clean visualizations. ...
    Downloads: 61 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    DataProfiler

    DataProfiler

    Extract schema, statistics and entities from datasets

    DataProfiler is an AI-powered tool for automatic data analysis and profiling, designed to detect patterns, anomalies, and schema inconsistencies in structured and unstructured datasets. The DataProfiler is a Python library designed to make data analysis, monitoring, and sensitive data detection easy. Loading Data with a single command, the library automatically formats & loads files into a DataFrame. Profiling the Data, the library identifies the schema, statistics, entities (PII / NPI), and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Bayesian Statistics

    Bayesian Statistics

    This repository holds slides and code for a full Bayesian statistics

    This repository holds slides and code for a full Bayesian statistics graduate course. Bayesian statistics is an approach to inferential statistics based on Bayes' theorem, where available knowledge about parameters in a statistical model is updated with the information in observed data. The background knowledge is expressed as a prior distribution and combined with observational data in the form of a likelihood function to determine the posterior distribution. The posterior can also be used...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Rspamd

    Rspamd

    Rapid spam filtering system

    Rspamd is an advanced spam filtering system and email processing framework that allows evaluation of messages by a number of rules including regular expressions, statistical analysis and custom services such as URL black lists. Each message is analysed by Rspamd and given a verdict that might be used by MTA for further processing (e.g. to reject a message, or add a special header indicating spam) along with other information, such as possible DKIM signature or modifications suggested for a...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Population Shift Monitoring

    Population Shift Monitoring

    Monitor the stability of a Pandas or Spark dataframe

    popmon is a package that allows one to check the stability of a dataset. popmon works with both pandas and spark datasets. popmon creates histograms of features binned in time-slices, and compares the stability of the profiles and distributions of those histograms using statistical tests, both over time and with respect to a reference. It works with numerical, ordinal, categorical features, and the histograms can be higher-dimensional, e.g. it can also track correlations between any two features. popmon can automatically flag and alert on changes observed over time, such as trends, shifts, peaks, outliers, anomalies, changing correlations, etc, using monitoring business rules. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    whylogs

    whylogs

    The open standard for data logging

    whylogs is an open-source library for logging any kind of data. With whylogs, users are able to generate summaries of their datasets (called whylogs profiles) which they can use to track changes in their dataset Create data constraints to know whether their data looks the way it should. Quickly visualize key summary statistics about their datasets. whylogs profiles are the core of the whylogs library. They capture key statistical properties of data, such as the distribution (far beyond...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AutoViz

    AutoViz

    Automatically Visualize any dataset, any size

    ...AutoViz supports a wide range of visualization types including scatter plots, histograms, bar charts, and correlation plots, making it suitable for analyzing both structured and large datasets. The system also includes built-in tools for evaluating data quality and identifying potential issues such as missing values or unusual distributions. By automating the visualization process, AutoViz allows users to rapidly explore datasets before applying machine learning models or statistical analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    see

    see

    Visualisation toolbox for beautiful and publication-ready figures

    see is an R package that serves as the visualization component of the easystats ecosystem, providing plotting utilities to produce publication-ready visualizations of statistical model parameters, diagnostics, predictions, and performance metrics. It works in conjunction with other easystats packages (such as parameters, performance, modelbased, bayestestR, etc.) to convert model outputs or summary objects into visual forms (dot-and-whisker plots, diagnostic plots, residual plots, etc.). It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NeuralForecast

    NeuralForecast

    Scalable and user friendly neural forecasting algorithms.

    NeuralForecast offers a large collection of neural forecasting models focusing on their performance, usability, and robustness. The models range from classic networks like RNNs to the latest transformers: MLP, LSTM, GRU, RNN, TCN, TimesNet, BiTCN, DeepAR, NBEATS, NBEATSx, NHITS, TiDE, DeepNPTS, TSMixer, TSMixerx, MLPMultivariate, DLinear, NLinear, TFT, Informer, AutoFormer, FedFormer, PatchTST, iTransformer, StemGNN, and TimeLLM. There is a shared belief in Neural forecasting methods'...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PyMC3

    PyMC3

    Probabilistic programming in Python

    ...Fit your model using gradient-based MCMC algorithms like NUTS, using ADVI for fast approximate inference — including minibatch-ADVI for scaling to large datasets, or using Gaussian processes to build Bayesian nonparametric models. PyMC3 includes a comprehensive set of pre-defined statistical distributions that can be used as model building blocks. Sometimes an unknown parameter or variable in a model is not a scalar value or a fixed-length vector, but a function. A Gaussian process (GP) can be used as a prior probability distribution whose support is over the space of continuous functions. PyMC3 provides rich support for defining and using GPs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    ...SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. SPPAS offers a file converter from/to a wide range of formats: xra, TextGrid, eaf, trs... <https://sppas.org>
    Downloads: 30 This Week
    Last Update:
    See Project
  • 20
    GrainSizeTools script

    GrainSizeTools script

    A Python script for estimating the grain size from thin sections

    Homepage & docs: http://marcoalopez.github.io/GrainSizeTools/ GrainSizeTools is a free, open-source, cross-platform script written in Python that provides several tools for (1) estimating average grain size in polycrystalline materials, (2) characterizing the nature of the distribution of grain sizes (either from apparent distributions or approximating 3D grain size distributions via stereology), and estimating differential stress via paleopizometers. The script requires as the input the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Statistics101 - Resampling Statistics

    Statistics101 - Resampling Statistics

    Use simulation to perform statistical analyses.

    Statistics101 is an Integrated Development Environment (IDE) that uses a simple, powerful language called “Resampling Stats” to develop Monte Carlo programs to analyze and solve statistical problems. The original Resampling Stats language and computer program were developed by Dr. Julian Simon and Peter Bruce as a new way to teach Statistics to social science students. Of course, social science students aren't the only ones who can benefit. Anyone who wants to learn statistics will find that...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22

    WhisperJAV

    A subtitle generator for Japanese Adult Videos.

    ...Transformer-based ASR architectures like Whisper suffer significant performance degradation when applied to the spontaneous and noisy domain of JAV. This degradation is driven by specific acoustic and temporal characteristics that defy the statistical distributions of standard training data.
    Leader badge
    Downloads: 39 This Week
    Last Update:
    See Project
  • 23
    WP 34s

    WP 34s

    Scientific/engineering firmware repurposing HP business calculators!

    This project has created scientific firmware for the HP-20b and HP-30b business calculators. WP 34S turns either of these calculators into a powerful keystroke programmable scientific device. According to our customers, it's the most powerful and fastest RPN scientific pocket calculator ever built. WP 34S is alive and stable since 2011. We have succeeded in satisfying the most picky users - read about their experiences on http://www.hpmuseum.org/forum/forum-8.html. Since 2014, WP...
    Leader badge
    Downloads: 35 This Week
    Last Update:
    See Project
  • 24
    ME.com

    ME.com

    Mathematical & engineering components for software development

    The truth can't be complicated. Mathematical & engineering components (abbreviation ME . com) are the comprehensive collection of algorithms for scientific computing. Registered 2003-03-21 with the Federal Service for Intellectual Property. Completely free. No donation required. Comes with absolutely no warranty. Not for clinical use. This software is not certified as a medical device and should not be used for diagnostic or treatment purposes. Rating: Everyone (All ages). NOTICE FOR EU...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Parallel and Distributed Process System

    Parallel and Distributed Process System

    NOTICE OF CONSOLIDATION & PARTNERSHIP PENDING As of April 2026, the 20

    NOTICE OF CONSOLIDATION & PARTNERSHIP PENDING As of April 2026, the 20 pipelines of the QCAUS/PDPBioGen suites are undergoing consolidation for high-scale institutional research. Core 'Ford 2026' algorithms remain the proprietary IP of the Ford Peace and Justice Foundation. Academic users at partner institutions are currently performing validation; all other commercial inquiries must contact the author Computational Neuroscience: Large-scale neural population dynamics, brain-inspired...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB