Showing 909 open source projects for "statistical"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Statistical Rethinking 2024

    Statistical Rethinking 2024

    This course teaches data analysis

    ...This version is designed for students following the 2024 lecture series, offering the most current set of examples, exercises, and teaching material aligned with the Statistical Rethinking framework. Online, flipped instruction. I will pre-record the lectures each week. We'll meet online once a week for an hour to discuss the material. The discussion time (3-4pm Berlin Time) should allow people in the Americas to join in their morning.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    ggstatsplot

    ggstatsplot

    Enhancing {ggplot2} plots with statistical analysis

    ...The central idea of {ggstatsplot} is simple: combine these two phases into one in the form of graphics with statistical details, which makes data exploration simpler and faster. Summary of statistical tests and effect sizes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    statsmodels

    statsmodels

    Statsmodels, statistical modeling and econometrics in Python

    statsmodels is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests, and statistical data exploration. An extensive list of result statistics are available for each estimator. The results are tested against existing statistical packages to ensure that they are correct. The package is released under the open source Modified BSD (3-clause) license.
    Downloads: 8 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    stdlib

    stdlib

    Standard library for JavaScript and Node.js

    A standard library for javascript and node.js. High performance, rigorous, and robust mathematical and statistical functions. Build advanced statistical models and machine learning libraries. Plotting and graphics functionality for data visualization and exploratory data analysis. Analyze and understand your data. Comprehensively tested utilities for application and library development. Functions to assert, group, filter, map, pluck, and transform your data both in browsers and on the server. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Criterium

    Criterium

    Benchmarking library for clojure

    Criterium is a robust benchmarking library for Clojure that addresses common statistical and JIT-related issues. It provides accurate timings through warm-up, garbage collection control, and statistical summaries—making microbenchmarking more reliable than using time. Statistical processing of multiple evaluations. Inclusion of a warm-up period, designed to allow the JIT compiler to optimise its code. Purging of gc before testing, to isolate timings from GC state prior to testing. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Mixed-effects models in Julia

    Mixed-effects models in Julia

    A Julia package for fitting (statistical) mixed-effects models

    This package defines linear mixed models (LinearMixedModel) and generalized linear mixed models (GeneralizedLinearMixedModel). Users can use the abstraction for statistical model API to build, fit (fit/fit!), and query the fitted models. A mixed-effects model is a statistical model for a response variable as a function of one or more covariates. For a categorical covariate the coefficients associated with the levels of the covariate are sometimes called effects, as in "the effect of using Treatment 1 versus the placebo". ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Book5_Essentials-Probability-Statistics

    Book5_Essentials-Probability-Statistics

    The book 5 of statistics in simplicity

    Book5_Essentials-of-Probability-and-Statistics is a Visualize-ML educational volume that introduces the statistical and probabilistic concepts underpinning modern data analysis and machine learning. The repository explains topics such as distributions, sampling, inference, and uncertainty using visual demonstrations and intuitive narratives. Its teaching philosophy prioritizes conceptual clarity over heavy formalism, making statistical thinking more approachable for beginners. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    GLM.jl

    GLM.jl

    Generalized linear models in Julia

    GLM.jl is a Julia package for fitting linear and generalized linear models (GLMs) with a syntax and functionality familiar to users of R or other statistical environments. It is part of the JuliaStats ecosystem and is tightly integrated with StatsModels.jl for formula handling, and Distributions.jl for specifying error families. The package supports modeling through both formula-based (e.g. @formula) and matrix-based interfaces, allowing both high-level convenience and low-level control. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    webR

    webR

    The statistical language R compiled to WebAssembly via Emscripten

    ...It supports installing and running R packages, making it possible to perform data analysis, statistical modeling, and visualization entirely client-side. webR also provides distribution options such as npm packages, CDN hosting, and Docker images for flexible deployment. While it currently includes a minimal set of compiled libraries, it is designed to expand its ecosystem over time.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    pmdarima

    pmdarima

    Statistical library designed to fill the void in Python's time series

    A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Synthetic Data Generator

    Synthetic Data Generator

    SDG is a specialized framework

    ...The system supports multiple generation methods including statistical models, generative adversarial networks, and large language model–based synthesis. It also includes a data processing module capable of handling different data types, preprocessing columns, managing missing values, and converting formats automatically before model training.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 13
    spaCy

    spaCy

    Industrial-strength Natural Language Processing (NLP)

    spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks, with an accuracy within 1% of the best available. It's blazing fast, easy to install and comes with a simple and productive API.
    Downloads: 74 This Week
    Last Update:
    See Project
  • 14
    DataFrame

    DataFrame

    C++ DataFrame for statistical, Financial, and ML analysis

    ...For example, you would compare this to Pandas, R data.frame, or Polars. You can slice the data in many different ways. You can join, merge, and group-by the data. You can run various statistical, summarization, financial, and ML algorithms on the data. You can add your custom algorithms easily. You can multi-column sort, custom pick, and delete the data. DataFrame also includes a large collection of analytical algorithms in the form of visitors. These are from basic stats such as Mean, and Std Deviation and return, … to more involved analysis such as Affinity Propagation, Polynomial Fit, and Fast Fourier transform of arbitrary length … including a good collection of trading indicators. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    i-Educar

    i-Educar

    Launching the most free educational software in Brazil

    Accessible from anywhere and with single student registration available for the entire education network. Time-saving for everyone. Get current quantitative, financial and statistical data on all processes, at the time and place you want. Evaluation system and reports adapted to the different realities of the country, with numerical, conceptual or descriptive evaluation notes. Management of allocations, removals, substitutions, absences and delays, offering an integrated view of all professionals. Time frame management for analysis of demands and availability of professionals in the education network in each school period. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    hyperfine

    hyperfine

    A command-line benchmarking tool

    A command-line benchmarking tool. Statistical analysis across multiple runs. Support for arbitrary shell commands. Constant feedback about the benchmark progress and current estimates. Warmup runs can be executed before the actual benchmark. Cache-clearing commands can be set up before each timing run. Statistical outlier detection to detect interference from other programs and caching effects.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    broom

    broom

    Convert statistical analysis objects from R into tidy format

    broom is part of the tidymodels ecosystem that converts statistical model outputs (e.g. from lm, glm, t.test, lme4, etc.) into tidy tibbles — standardized data frames — using functions tidy(), glance(), and augment(). These are easier to manipulate, visualize, and report programmatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    RCall.jl

    RCall.jl

    Call R from Julia

    R is a language for statistical computing and graphics that has been around for a couple of decades and it has one of the most impressive collections of scientific and statistical packages of any environment. Recently, the Julia language has become an attractive alternative because it provides the remarkable performance of a low-level language without sacrificing the readability and ease of use of high-level languages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    StatsForecast

    StatsForecast

    Fast forecasting with statistical and econometric models

    StatsForecast is a Python library for time-series forecasting that delivers a suite of classical statistical and econometric forecasting models optimized for high performance and scalability. It is designed not just for academic experiments but for production-level time-series forecasting, meaning it handles forecasting for many series at once, efficiently, reliably, and with minimal overhead. The library implements a broad set of models, including AutoARIMA, ETS, CES, Theta, plus a battery of benchmarking and baseline methods, giving users flexibility in selecting forecasting approaches depending on data characteristics (trend, seasonality, intermittent demand, etc.). ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    PaperBanana

    PaperBanana

    Extension of Google Research’s PaperBanana

    PaperBanana is an open-source agentic framework designed to automatically generate publication-quality academic diagrams and statistical plots directly from text descriptions. The project focuses on helping researchers, educators, and data scientists transform conceptual descriptions of figures into structured visual outputs suitable for research papers, presentations, and technical reports. Instead of manually designing charts or diagrams using traditional visualization tools, users can describe the desired figure in natural language and allow the system to generate the visual representation automatically. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    G2

    G2

    Interactive data-driven visualization grammar for statistical charts

    G2 is a highly interactive data-driven visualization grammar for statistical charts. with a high level of usability and scalability. It provides a set of grammar, and takes users beyond a limited set of charts to an almost unlimited world of graphical forms. With G2, you can describe the visual appearance and interactive behavior of visualization just by one statement, and generate web-based views using Canvas or SVG.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Smile

    Smile

    Statistical machine intelligence and learning engine

    Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    NVIDIA Earth2Studio

    NVIDIA Earth2Studio

    Open-source deep-learning framework

    ...The toolkit makes it easy to run deterministic and ensemble forecasts, swap models interchangeably, and process large geophysical datasets with Xarray structures, enabling experimentation with state-of-the-art deep learning models for climate and atmospheric prediction. Users can extend Earth2Studio with optional model packs, advanced data interfaces, statistical operators, and backend integrations that support flexible workflows from simple tests to large-scale operational inference.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    plotly.js

    plotly.js

    JavaScript charting library behind Plotly and Dash

    Plotly JavaScript Open Source Graphing Library. Built on top of d3.js and stack.gl, Plotly.js is a high-level, declarative charting library. plotly.js ships with over 40 chart types, including 3D charts, statistical graphs, and SVG maps. plotly.js is free and open source and you can view the source, report issues or contribute on GitHub. For plotly.js to build with Webpack you will need to install ify-loader@v1.1.0+ and add it to your webpack.config.json. This adds Browserify transform compatibility to Webpack which is necessary for some plotly.js dependencies. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    collapse

    collapse

    Advanced and Fast Data Transformation in R

    collapse is a high-performance R package designed for fast and efficient data transformation, aggregation, reshaping, and statistical computation. Built to offer a more performant alternative to dplyr and data.table, it is particularly well-suited for large datasets and econometric applications. It operates on base R data structures like data frames and vectors and uses highly optimized C++ code under the hood to deliver significant speed improvements. collapse also includes tools for grouped operations, weighted statistics, and time series manipulation, making it a compact yet powerful utility for data scientists and researchers working in R.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB