52 projects for "statistical" with 2 filters applied:

  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Synthetic Data Generator

    Synthetic Data Generator

    SDG is a specialized framework

    ...The system supports multiple generation methods including statistical models, generative adversarial networks, and large language model–based synthesis. It also includes a data processing module capable of handling different data types, preprocessing columns, managing missing values, and converting formats automatically before model training.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    NVIDIA Earth2Studio

    NVIDIA Earth2Studio

    Open-source deep-learning framework

    ...The toolkit makes it easy to run deterministic and ensemble forecasts, swap models interchangeably, and process large geophysical datasets with Xarray structures, enabling experimentation with state-of-the-art deep learning models for climate and atmospheric prediction. Users can extend Earth2Studio with optional model packs, advanced data interfaces, statistical operators, and backend integrations that support flexible workflows from simple tests to large-scale operational inference.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    PaperBanana

    PaperBanana

    Extension of Google Research’s PaperBanana

    PaperBanana is an open-source agentic framework designed to automatically generate publication-quality academic diagrams and statistical plots directly from text descriptions. The project focuses on helping researchers, educators, and data scientists transform conceptual descriptions of figures into structured visual outputs suitable for research papers, presentations, and technical reports. Instead of manually designing charts or diagrams using traditional visualization tools, users can describe the desired figure in natural language and allow the system to generate the visual representation automatically. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    MEDIUM_NoteBook

    MEDIUM_NoteBook

    Repository containing notebooks of my posts on Medium

    ...Each notebook typically focuses on explaining a specific concept through step-by-step examples that combine explanatory text, code, and visual outputs. The repository covers a wide variety of data science topics such as predictive modeling, data preprocessing, statistical analysis, and feature engineering. Because the notebooks are designed as educational materials, they often emphasize readability and reproducibility so that readers can easily run and modify the examples. The project is useful for learners who want to explore machine learning concepts interactively using Python and common data science libraries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    Natural Language Toolkit
    The Natural Language Toolkit (NLTK) is a widely used open-source Python library designed for working with human language data and building natural language processing (NLP) applications. It provides a comprehensive suite of modules, datasets, and tutorials that support both symbolic and statistical approaches to language processing. The toolkit includes implementations of many foundational NLP algorithms and utilities, enabling developers to perform tasks such as tokenization, stemming, parsing, classification, and semantic reasoning. NLTK was originally developed to support research and teaching in computational linguistics and artificial intelligence, and it has become one of the most influential educational platforms for learning NLP in Python. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    NBA Sports Betting Machine Learning

    NBA Sports Betting Machine Learning

    NBA sports betting using machine learning

    ...Machine learning models are then trained to estimate the probability that a team will win a game as well as whether the total score will fall above or below the sportsbook’s predicted total. In addition to predicting outcomes, the project evaluates expected value to determine whether a potential bet offers a statistical advantage compared with sportsbook odds.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 7
    Quantitative Trading System

    Quantitative Trading System

    A comprehensive quantitative trading system with AI-powered analysis

    ...The project is designed to provide an end-to-end infrastructure for building and operating algorithmic trading strategies in financial markets. It includes tools for collecting and processing market data from multiple sources, performing statistical and machine learning analysis, and generating trading signals based on quantitative models. The system supports real-time data streaming, allowing strategies to respond to market conditions as they evolve. QuantMuse also incorporates advanced risk management features, including portfolio monitoring, risk limits, and dynamic position sizing to control exposure.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    AI Marketing Skills

    AI Marketing Skills

    Open-source AI marketing skills for Claude Code

    AI Marketing Skills is a comprehensive open-source framework designed to transform AI agents into fully operational marketing and sales systems by equipping them with structured, reusable “skills” that automate real business workflows. Instead of simple prompts, the project provides complete operational modules that include scripts, scoring systems, and decision-making logic, allowing AI tools like Claude Code to execute complex marketing tasks end-to-end. The system is organized into...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    mlforecast

    mlforecast

    Scalable machine learning for time series forecasting

    mlforecast is a time-series forecasting framework built around machine-learning models, designed to make forecasting both efficient and scalable. It lets you apply any regressor that follows the typical scikit-learn API, for example, gradient-boosted trees or linear models, to time-series data by automating much of the messy feature engineering and data preparation. Instead of writing custom code to build lagged features, rolling statistics, and date-based predictors, mlforecast generates...
    Downloads: 7 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Python Programming Hub

    Python Programming Hub

    Learn Python and Machine Learning from scratch

    ...Many sections are implemented as Jupyter notebooks, allowing learners to run code interactively while reading explanations of the concepts involved. The repository emphasizes hands-on learning by demonstrating real programming tasks such as data manipulation, statistical analysis, visualization, and automation. It also includes examples of commonly used libraries such as NumPy, Pandas, and other tools used in data science workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AutoViz

    AutoViz

    Automatically Visualize any dataset, any size

    ...The system also includes built-in tools for evaluating data quality and identifying potential issues such as missing values or unusual distributions. By automating the visualization process, AutoViz allows users to rapidly explore datasets before applying machine learning models or statistical analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    stkpp

    stkpp

    C++ Statistical ToolKit

    STK++ (http://www.stkpp.org) is a versatile, fast, reliable and elegant collection of C++ classes for statistics, clustering, linear algebra, arrays (with an Eigen-like API), regression, dimension reduction, etc. Some functionalities provided by the library are available in the R environment as R functions (http://cran.at.r-project.org/web/packages/rtkore/index.html). At a convenience, we propose the source packages on sourceforge. The library offers a dense set of (mostly) template...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    sketch

    sketch

    AI code-writing assistant that understands data content

    Sketch is an open-source AI-powered data analysis assistant designed specifically for pandas users, enabling natural language interaction with tabular datasets to generate code, insights, and transformations. It works by summarizing the structure and statistical properties of a dataset and providing that context to a language model, allowing it to generate highly relevant and accurate responses tailored to the data. The tool integrates directly into pandas dataframes through an extension, making it easy to use within existing Python workflows without requiring additional IDE plugins. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    pattern_classification

    pattern_classification

    A collection of tutorials and examples for solving machine learning

    The pattern_classification repository is an educational project that provides tutorials, examples, and reference materials related to machine learning and statistical pattern recognition. The project aims to help learners understand the process of building predictive models by presenting structured explanations and practical examples. It includes notebooks and guides that demonstrate data preprocessing, feature extraction, model training, and evaluation techniques used in machine learning workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    QuantResearch

    QuantResearch

    Quantitative analysis, strategies and backtests

    ...The project integrates machine learning methods with traditional quantitative finance models, illustrating how statistical techniques can be applied to asset management and trading.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Complete Machine Learning Package

    Complete Machine Learning Package

    A comprehensive machine learning repository containing 30+ notebooks

    Complete Machine Learning Package repository is a comprehensive educational collection of machine learning notebooks designed to teach core data science and AI concepts through practical coding examples. The project includes more than thirty notebooks that cover a wide range of topics including data analysis, statistical modeling, neural networks, and deep learning. Each notebook introduces theoretical ideas and then demonstrates how to implement them using Python libraries commonly used in data science, such as NumPy, pandas, scikit-learn, and TensorFlow. The repository also includes examples related to natural language processing, computer vision, and data visualization, giving learners exposure to several subfields of machine learning. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    fe4ml-zh

    fe4ml-zh

    Feature Engineering for Machine Learning

    ...The project explains techniques for creating, selecting, and transforming features in ways that improve model accuracy and robustness. It also discusses the role of domain knowledge, data preprocessing, and statistical reasoning in building effective machine learning models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Machine Learning Git Codebook

    Machine Learning Git Codebook

    For extensive instructor led learning

    ...The repository organizes these topics into sequential notebooks that explain theoretical concepts while allowing users to experiment directly with code. Many lessons emphasize hands-on exercises where learners analyze datasets, implement algorithms, and evaluate results through visualizations and statistical metrics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ISLR-python

    ISLR-python

    An Introduction to Statistical Learning

    ISLR-python is an educational repository that provides Python implementations and notebooks corresponding to examples and exercises from the book An Introduction to Statistical Learning. The project recreates tables, figures, and laboratory exercises originally presented in the book so that readers can explore the concepts using Python rather than the original R environment. The repository includes Jupyter notebooks demonstrating statistical learning methods such as linear regression, classification algorithms, resampling methods, and model evaluation techniques. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Pattern Recognition and Machine Learning

    Pattern Recognition and Machine Learning

    Repository of notes, code and notebooks in Python

    ...The project recreates many of the mathematical concepts and diagrams from the book using executable Jupyter notebooks, allowing readers to experiment directly with the algorithms described in the text. Each section of the repository corresponds to chapters in the book and includes code examples that demonstrate statistical modeling, machine learning methods, and Bayesian inference techniques. These notebooks provide visualizations and computational demonstrations that help clarify complex topics such as probabilistic models, neural networks, kernel methods, and graphical models. The repository also includes implementations of sampling methods, clustering algorithms, and dimensionality reduction techniques used throughout machine learning research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Machine Learning in Asset Management

    Machine Learning in Asset Management

    Machine Learning in Asset Management

    ...The repository also includes references to academic research, tutorials, and datasets that help users understand how machine learning can enhance traditional investment strategies. Many of the experiments focus on applying supervised learning, reinforcement learning, and statistical modeling techniques to financial data. By combining theory, research papers, and practical implementations, the repository functions as both a learning platform and a research resource for quantitative finance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Eiten

    Eiten

    Statistical and Algorithmic Investing Strategies for Everyone

    Eiten is an open-source Python project focused on providing statistical and algorithmic trading strategies powered by data analysis and machine learning techniques. It is designed to make quantitative investing more accessible by offering ready-to-use strategies that analyze market behavior, detect patterns, and generate actionable insights. The project includes tools for evaluating stock performance, identifying trends, and applying algorithmic models to financial data, enabling users to experiment with different investment approaches. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Machine Learning Mindmap

    Machine Learning Mindmap

    A mindmap summarising Machine Learning concepts

    ...The project organizes a wide range of machine learning topics into an interconnected diagram that helps learners understand how concepts relate to one another across the broader field of artificial intelligence. The mind map covers fundamental areas such as data preprocessing, statistical analysis, supervised learning, unsupervised learning, reinforcement learning, and deep learning architectures. By arranging these concepts visually, the repository allows students and practitioners to quickly explore the relationships between algorithms, techniques, and modeling approaches used in modern machine learning workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    spark-ml-source-analysis

    spark-ml-source-analysis

    Spark ml algorithm principle analysis and specific source code

    spark-ml-source-analysis is a technical repository that analyzes the internal implementation of machine learning algorithms within Apache Spark’s MLlib library. The project aims to help developers and data scientists understand how distributed machine learning algorithms are implemented and optimized inside the Spark ecosystem. Instead of providing a runnable software system, the repository focuses on explaining algorithm principles and examining the underlying source code used in Spark’s...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DS-Take-Home

    DS-Take-Home

    Solution to the book A Collection of Data Science Take-Home Challenge

    DS-Take-Home is a repository that provides practical solutions to a series of real-world data science challenges inspired by the book A Collection of Data Science Take-Home Challenges. The project is designed as a learning resource where aspiring data scientists can study how typical industry-style take-home assignments are solved using data analysis and machine learning techniques. Each challenge is implemented in a separate Jupyter notebook that walks through the process of analyzing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB