Showing 580 open source projects for "ml"

View related business solutions
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    CoreNet

    CoreNet

    CoreNet: A library for training deep neural networks

    ...Its distributed runtime manages synchronization, load balancing, and mixed-precision computation to maximize throughput while minimizing communication bottlenecks. CoreNet integrates tightly with Apple’s proprietary ML stack and hardware, serving as the foundation for research in computer vision, language models, and multimodal systems within Apple AI. The framework includes monitoring tools, fault tolerance mechanisms, and efficient checkpointing for massive training runs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Practical Machine Learning with Python

    Practical Machine Learning with Python

    Master the essential skills needed to recognize and solve problems

    Practical Machine Learning with Python is a comprehensive repository built to accompany a project-centered guide for applying machine learning techniques to real-world problems using Python’s mature data science ecosystem. It centralizes example code, datasets, model pipelines, and explanatory notebooks that teach users how to approach problems from data ingestion and cleaning all the way through feature engineering, model selection, evaluation, tuning, and production-ready deployment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Zerox OCR

    Zerox OCR

    PDF to Markdown with vision models

    A dead simple way of OCR-ing a document for AI ingestion. Documents are meant to be a visual representation after all. With weird layouts, tables, charts, etc. The vision models just make sense. ZeroX is an open-source machine learning framework designed for fast experimentation and production deployment, optimized for speed and ease of use.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    TensorFlow Datasets

    TensorFlow Datasets

    TFDS is a collection of datasets ready to use with TensorFlow,

    TensorFlow Datasets is a collection of datasets ready to use, with TensorFlow or other Python ML frameworks, such as Jax. All datasets are exposed as tf.data. Datasets , enabling easy-to-use and high-performance input pipelines. To get started see the guide and our list of datasets.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    Groq Python

    Groq Python

    The official Python Library for the Groq API

    ...This makes it easy to integrate Groq-powered AI capabilities into backend services, data pipelines, research notebooks, or applications written in Python. For those building AI-based tooling, automation scripts, or ML-backed backends, groq-python abstracts away HTTP request plumbing and exposes a clean API, accelerating development and reducing boilerplate.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    MLJAR Studio

    MLJAR Studio

    Python package for AutoML on Tabular Data with Feature Engineering

    ...It abstracts the common way to preprocess the data, construct the machine learning models, and perform hyper-parameter tuning to find the best model. It is no black box, as you can see exactly how the ML pipeline is constructed (with a detailed Markdown report for each ML model).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    GoldenCheetah

    GoldenCheetah

    Performance Software for Cyclists, Runners, Triathletes and Coaches

    Analyze using summary metrics like BikeStress, TRIMP, or RPE. Extract insight via models like Critical Power and W'bal. Track and predict performance using models like Banister and PMC. Optimize aerodynamics using Virtual Elevation. Train indoors with ANT and BTLE trainers. Upload and Download with many cloud services including Strava, Withings, and Today's Plan. Import and export data to and from a wide range of bike computers and file formats. Track body measures, and equipment use and set...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Robyn

    Robyn

    Experimental, AI/ML-powered and open sourced Marketing Mix Modeling

    Robyn is an open-source, AI/ML-powered Marketing Mix Modeling (MMM) toolkit developed by Meta Marketing Science under the “facebookexperimental” GitHub umbrella. Its goal is to democratize rigorous MMM: what traditionally required expert statisticians and expensive consulting becomes accessible to any company with data. Robyn takes in historical data (spends on different marketing channels, conversions, or revenue, and optional context or organic-media variables) and uses a combination of techniques, regularized regression (Ridge), time-series decomposition (trend, seasonality, holiday effects), and hyperparameter optimization (via evolutionary algorithms), to estimate the incremental impact of each marketing channel. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Qbot

    Qbot

    AI-powered Quantitative Investment Research Platform

    Qbot is an open source quantitative research and trading platform that provides a full pipeline from data ingestion and strategy development to backtesting, simulation, and (optionally) live trading. It bundles a lightweight GUI client (built with wxPython) and a modular backend so researchers can iterate on strategies, run batch backtests, and validate ideas in a near-real simulated environment that models latency and slippage. The project places special emphasis on AI-driven strategies —...
    Downloads: 46 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    NanoNeuron

    NanoNeuron

    NanoNeuron is 7 simple JavaScript functions

    Nano-Neuron is a didactic project that reduces the idea of a neuron to a handful of tiny JavaScript functions so learners can see “learning” in action without heavy frameworks. It demonstrates how a scalar input can be linearly transformed with a weight and bias, then adjusted via gradient updates to fit a simple mapping such as Celsius-to-Fahrenheit conversion. The code emphasizes readability over performance, inviting you to step through calculations and watch parameters converge. Because...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TensorStore

    TensorStore

    Library for reading and writing large multi-dimensional arrays

    ...Rich indexing, slicing, and broadcasting operations make it feel like a familiar array API, while asynchronous I/O pipelines stream chunks efficiently in parallel. Transactional semantics allow atomic updates and consistent snapshots, which is essential for large, shared datasets used by ML and scientific workflows. The library is engineered for scalability—background caching, chunk sharding, and retryable operations keep throughput high even over unreliable networks. With language bindings, it fits into Python-heavy analysis pipelines while retaining a fast C++ core.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Opacus

    Opacus

    Training PyTorch models with differential privacy

    ...Supports most types of PyTorch models and can be used with minimal modification to the original neural network. Open source, modular API for differential privacy research. Everyone is welcome to contribute. ML practitioners will find this to be a gentle introduction to training a model with differential privacy as it requires minimal code changes. Differential Privacy researchers will find this easy to experiment and tinker with, allowing them to focus on what matters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ONNX Runtime

    ONNX Runtime

    ONNX Runtime: cross-platform, high performance ML inferencing

    ONNX Runtime is a cross-platform inference and training machine-learning accelerator. ONNX Runtime inference can enable faster customer experiences and lower costs, supporting models from deep learning frameworks such as PyTorch and TensorFlow/Keras as well as classical machine learning libraries such as scikit-learn, LightGBM, XGBoost, etc. ONNX Runtime is compatible with different hardware, drivers, and operating systems, and provides optimal performance by leveraging hardware accelerators...
    Downloads: 59 This Week
    Last Update:
    See Project
  • 14
    MLC LLM

    MLC LLM

    Universal LLM Deployment Engine with ML Compilation

    MLC LLM is a machine learning compiler and deployment framework designed to enable efficient execution of large language models across a wide range of hardware platforms. The project focuses on compiling models into optimized runtimes that can run natively on devices such as GPUs, mobile processors, browsers, and edge hardware. By leveraging machine learning compilation techniques, mlc-llm produces high-performance inference engines that maintain consistent APIs across platforms. The system...
    Downloads: 38 This Week
    Last Update:
    See Project
  • 15
    imbalanced-learn

    imbalanced-learn

    A Python Package to Tackle the Curse of Imbalanced Datasets in ML

    Imbalanced-learn (imported as imblearn) is an open source, MIT-licensed library relying on scikit-learn (imported as sklearn) and provides tools when dealing with classification with imbalanced classes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Netron

    Netron

    Visualizer for neural network, deep learning, machine learning models

    Netron is a viewer for neural network, deep learning and machine learning models. Netron supports ONNX, Keras, TensorFlow Lite, Caffe, Darknet, Core ML, MNN, MXNet, ncnn, PaddlePaddle, Caffe2, Barracuda, Tengine, TNN, RKNN, MindSpore Lite, and UFF. Netron has experimental support for TensorFlow, PyTorch, TorchScript, OpenVINO, Torch, Arm NN, BigDL, Chainer, CNTK, Deeplearning4j, MediaPipe, ML.NET, scikit-learn, TensorFlow.js. There is an extense variety of sample model files to download or open using the browser version. ...
    Downloads: 48 This Week
    Last Update:
    See Project
  • 17
    Lance

    Lance

    Modern columnar data format for ML and LLMs implemented in Rust

    Lance is a columnar data format that is easy and fast to version, query and train on. It’s designed to be used with images, videos, 3D point clouds, audio and of course tabular data. It supports any POSIX file systems, and cloud storage like AWS S3 and Google Cloud Storage.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    cuML

    cuML

    RAPIDS Machine Learning Library

    cuML is a suite of libraries that implement machine learning algorithms and mathematical primitives functions that share compatible APIs with other RAPIDS projects. cuML enables data scientists, researchers, and software engineers to run traditional tabular ML tasks on GPUs without going into the details of CUDA programming. In most cases, cuML's Python API matches the API from scikit-learn. For large datasets, these GPU-based implementations can complete 10-50x faster than their CPU equivalents. For details on performance, see the cuML Benchmarks Notebook.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    LangKit

    LangKit

    An open-source toolkit for monitoring Language Learning Models (LLMs)

    ...Productionizing language models, including LLMs, comes with a range of risks due to the infinite amount of input combinations, which can elicit an infinite amount of outputs. The unstructured nature of text poses a challenge in the ML observability space - a challenge worth solving, since the lack of visibility on the model's behavior can have serious consequences.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    handson-ml3

    handson-ml3

    Fundamentals of Machine Learning and Deep Learning

    ...The notebooks are designed so you can run them locally or on Colab/online, making it accessible for learners regardless of infrastructure. The author includes solutions for exercises and sets up an environment specification so you can reproduce results. Because the discipline of ML evolves rapidly, this repo serves both as a learning path and a reference library you can revisit as models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Petastorm

    Petastorm

    Petastorm library enables single machine or distributed training

    Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. Petastorm is an open-source data access library developed at Uber ATG. This library enables single machine or distributed training and evaluation of deep learning models directly from datasets in Apache Parquet format. Petastorm supports popular Python-based machine learning (ML) frameworks such as Tensorflow, PyTorch, and PySpark. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MLDatasets.jl

    MLDatasets.jl

    Utility package for accessing common Machine Learning datasets

    This package represents a community effort to provide a common interface for accessing common Machine Learning (ML) datasets. In contrast to other data-related Julia packages, the focus of MLDatasets.jl is specifically on downloading, unpacking, and accessing benchmark datasets. Functionality for the purpose of data processing or visualization is only provided to a degree that is special to some datasets.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    TurboQuant+

    TurboQuant+

    Implementation of TurboQuant (ICLR 2026)

    ...TurboQuant Plus focuses on experimentation and performance tuning, allowing developers to test different configurations and evaluate trade-offs. Its architecture supports extensibility, enabling further development of quantization methods and integration with existing ML pipelines.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 24
    BindsNET

    BindsNET

    Simulation of spiking neural networks (SNNs) using PyTorch

    ...BindsNET is a spiking neural network simulation library geared towards the development of biologically inspired algorithms for machine learning. This package is used as part of ongoing research on applying SNNs to machine learning (ML) and reinforcement learning (RL) problems in the Biologically Inspired Neural & Dynamical Systems (BINDS) lab.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    Flexprice

    Flexprice

    Usage-based pricing and billing for developers

    Flexprice is an open-source dynamic pricing engine designed to help online businesses and marketplaces automate and optimize their pricing strategies. It allows developers and data scientists to experiment with pricing algorithms using real-time market data, inventory levels, and historical sales to maximize revenue, conversion, or competitiveness. Built with flexibility in mind, Flexprice can be integrated into existing e-commerce infrastructure via APIs and supports simulation and A/B...
    Downloads: 9 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB