Showing 106 open source projects for "statistical"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    MTBook

    MTBook

    Machine Translation: Foundations and Models

    This is a tutorial, the purpose is to introduce the basic knowledge and modeling methods of machine translation systematically, and on this basis, discuss some cutting-edge technologies of machine translation (formerly known as "Machine Translation: Statistical Modeling and Deep Learning") method"). Its content is compiled into a book, which can be used for the study of senior undergraduates and graduate students in computer and artificial intelligence related majors, and can also be used as reference material for researchers related to natural language processing, especially machine translation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    libpostal

    libpostal

    A C library for parsing/normalizing street addresses around the world

    A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data. libpostal is a C library for parsing/normalizing street addresses around the world using statistical NLP and open data. The goal of this project is to understand location-based strings in every language, everywhere. Addresses and the locations they represent are essential for any application dealing with maps (place search, transportation, on-demand/delivery services, check-ins, reviews). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Pipeline for training Language Models

    Pipeline for training Language Models

    Pipeline for training Language Models using PyTorch.

    Pipeline for training Language Models using PyTorch. Inspired by Yandex Data School NLP Course (week 03: Language Modeling) Prepared text file with space-separated words on each line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Eiten

    Eiten

    Statistical and Algorithmic Investing Strategies for Everyone

    Eiten is an open-source Python project focused on providing statistical and algorithmic trading strategies powered by data analysis and machine learning techniques. It is designed to make quantitative investing more accessible by offering ready-to-use strategies that analyze market behavior, detect patterns, and generate actionable insights. The project includes tools for evaluating stock performance, identifying trends, and applying algorithmic models to financial data, enabling users to experiment with different investment approaches. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    Machine Learning Mindmap

    Machine Learning Mindmap

    A mindmap summarising Machine Learning concepts

    ...The project organizes a wide range of machine learning topics into an interconnected diagram that helps learners understand how concepts relate to one another across the broader field of artificial intelligence. The mind map covers fundamental areas such as data preprocessing, statistical analysis, supervised learning, unsupervised learning, reinforcement learning, and deep learning architectures. By arranging these concepts visually, the repository allows students and practitioners to quickly explore the relationships between algorithms, techniques, and modeling approaches used in modern machine learning workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    NeuralCoref

    NeuralCoref

    Fast Coreference Resolution in spaCy with Neural Networks

    ...For a brief introduction to coreference resolution and NeuralCoref, please refer to our blog post. NeuralCoref is written in Python/Cython and comes with a pre-trained statistical model for English only. NeuralCoref is accompanied by a visualization client NeuralCoref-Viz, a web interface powered by a REST server that can be tried online.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    automl-gs

    automl-gs

    Provide an input CSV and a target field to predict, generate a model

    ...No black box: you can see exactly how the data is processed, and how the model is constructed, and you can make tweaks as necessary. automl-gs is an AutoML tool which, unlike Microsoft's NNI, Uber's Ludwig, and TPOT, offers a zero code/model definition interface to getting an optimized model and data transformation pipeline in multiple popular ML/DL frameworks, with minimal Python dependencies (pandas + scikit-learn + your framework of choice). automl-gs is designed for citizen data scientists and engineers without a deep statistical background under the philosophy that you don't need to know any modern data preprocessing and machine learning engineering techniques to create a powerful prediction workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    spark-ml-source-analysis

    spark-ml-source-analysis

    Spark ml algorithm principle analysis and specific source code

    spark-ml-source-analysis is a technical repository that analyzes the internal implementation of machine learning algorithms within Apache Spark’s MLlib library. The project aims to help developers and data scientists understand how distributed machine learning algorithms are implemented and optimized inside the Spark ecosystem. Instead of providing a runnable software system, the repository focuses on explaining algorithm principles and examining the underlying source code used in Spark’s...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DS-Take-Home

    DS-Take-Home

    Solution to the book A Collection of Data Science Take-Home Challenge

    DS-Take-Home is a repository that provides practical solutions to a series of real-world data science challenges inspired by the book A Collection of Data Science Take-Home Challenges. The project is designed as a learning resource where aspiring data scientists can study how typical industry-style take-home assignments are solved using data analysis and machine learning techniques. Each challenge is implemented in a separate Jupyter notebook that walks through the process of analyzing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    newLISP for BSDs, LINUX, MacOS X, SunOS and Win32: small, fast 350+ functions, a -C-, MySQL, PostgreSQL, SQLite, ODBC, TCP/IP, UDP, XML, Java interface, string processing, regular expressions , math, financial, statistical functions, Win32 DLL
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Lihang

    Lihang

    Statistical learning methods (2nd edition) [Li Hang]

    Lihang is an open-source repository that provides educational notes, mathematical derivations, and code implementations based on the book Statistical Learning Methods by Li Hang. The repository aims to help readers understand the theoretical foundations of machine learning algorithms through practical implementations and detailed explanations. It includes notebooks and scripts that demonstrate how key algorithms such as perceptrons, decision trees, logistic regression, support vector machines, and hidden Markov models work in practice. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Presage

    the intelligent predictive text entry platform

    ...Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic predictive algorithms. Presage's predictive capabilities are implemented by predictive plugins. Predictive plugins use services provided by the platform to implement multiple prediction techniques.
    Leader badge
    Downloads: 231 This Week
    Last Update:
    See Project
  • 13
    VOSM
    2D Statistical Models. Building: shape model, texture model and concatenated appearance model; Fitting: 1D profile ASM, 2D profile ASM, direct local texture constrained(LTC) ASM, basic AAM, ICIA AAM, IAIA AAM, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DeepLearn

    DeepLearn

    Implementation of research papers on Deep Learning+ NLP+ CV in Python

    Welcome to DeepLearn. This repository contains an implementation of the following research papers on NLP, CV, ML, and deep learning. The required dependencies are mentioned in requirement.txt. I will also use dl-text modules for preparing the datasets. If you haven't use it, please do have a quick look at it. CV, transfer learning, representation learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Accord.NET Framework

    Accord.NET Framework

    Scientific computing, machine learning and computer vision for .NET

    The Accord.NET Framework provides machine learning, mathematics, statistics, computer vision, computer audition, and several scientific computing related methods and techniques to .NET. The project is compatible with the .NET Framework. NET Standard, .NET Core, and Mono.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16

    BioRec:Bird Census field data annotation

    Recognizing biological data from a notebook.

    ...Namely, bird census based on personal inspection or small (~10 km^2) regions with recording birds' position and behaviour on paper. This project makes it easy to annotate such field data and to make this data available for statistical analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    cbrTekStraktor

    an application to automatically extract text from comic books.

    ...The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is achieved by a combination of statistical and graphical processing operations. It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text. A subsequent version of the application will integrate with translation software in order to provide automated translation of comic book texts and re-inserion of translated texts
    Downloads: 11 This Week
    Last Update:
    See Project
  • 18
    ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for Chinese and English. Currently support word segmentation, POS tagging, dependency and phrase-structure parsing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Phrasal

    Phrasal

    Statistical phrase-based machine translation system

    Stanford Phrasal is a state-of-the-art statistical phrase-based machine translation system, written in Java. At its core, it provides much the same functionality as the core of Moses. Distinctive features include: providing an easy to use API for implementing new decoding model features, the ability to translating using phrases that include gaps (Galley et al. 2010), and conditional extraction of phrase-tables and lexical reordering models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Adaptive Gaussian Filtering

    Adaptive Gaussian Filtering

    Machine learning with Gaussian kernels.

    Libagf is a machine learning library that includes adaptive kernel density estimators using Gaussian kernels and k-nearest neighbours. Operations include statistical classification, interpolation/non-linear regression and pdf estimation. For statistical classification there is a borders training feature for creating fast and general pre-trained models that nonetheless return the conditional probabilities. Libagf also includes clustering algorithms as well as comparison and validation routines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The Thot toolkit repository has moved to http://daormar.github.io/thot/ Thot is a toolkit for statistical machine translation. The new Thot toolkit includes fully automatic and interactive machine translation, incremental training of statistical models, parallel estimation, ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Natural Language Analysis with Ngrams

    NLP tool for statistical analysis of words, sentences, documents

    Goal of this project is to have a NLP tool that would give statistical analysis results based on Google Ngram data. Furthermore, it is now just a NetBeans project without a final JAR. Furthermore, there will be a github version for anyone who wishes to contribute. In the future versions, user will be able to convert a single word to numerical data, to be able to compare two words and get the comparison data, and to be able to do the same for the sentences, paragraphs and documents. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Chordalysis

    Log-linear analysis (data modelling) for high-dimensional data

    ===== Project moved to https://github.com/fpetitjean/Chordalysis ===== Log-linear analysis is the statistical method used to capture multi-way relationships between variables. However, due to its exponential nature, previous approaches did not allow scale-up to more than a dozen variables. We present here Chordalysis, a log-linear analysis method for big data. Chordalysis exploits recent discoveries in graph theory by representing complex models as compositions of triangular structures, also known as chordal graphs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DeSR is a multilingual statistical dependency parser. It produces dependency parse trees for natural language sentences using a parsing model learned from annotated corpora.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A python module that provides algorithms for advanced search - basically all you need to build a search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB