Showing 227 open source projects for "ml"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    SageMaker Experiments Python SDK

    SageMaker Experiments Python SDK

    Experiment tracking and metric logging for Amazon SageMaker notebooks

    Experiment tracking in SageMaker Training Jobs, Processing Jobs, and Notebooks. SageMaker Experiments is an AWS service for tracking machine learning Experiments. The SageMaker Experiments Python SDK is a high-level interface to this service that helps you track Experiment information using Python. Experiment tracking powers the machine learning integrated development environment Amazon SageMaker Studio. Experiment: A collection of related Trials. Add Trials to an Experiment that you wish to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OptiMate

    OptiMate

    Libraries for optimizing AI models, inference speed, and GPU usage

    Optimate is an open source collection of libraries designed to optimize the performance and cost efficiency of artificial intelligence models across different stages of the machine learning lifecycle. It groups several internal optimization tools developed by Nebuly AI into a single repository that focuses on improving inference speed, reducing infrastructure usage, and streamlining model training workflows. Its modules help developers automatically apply optimization techniques that better...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    FFCV

    FFCV

    Fast Forward Computer Vision (and other ML workloads!)

    ffcv is a drop-in data loading system that dramatically increases data throughput in model training. From gridding to benchmarking to fast research iteration, there are many reasons to want faster model training. Below we present premade codebases for training on ImageNet and CIFAR, including both (a) extensible codebases and (b) numerous premade training configurations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    UnionML

    UnionML

    Build and deploy machine learning microservices

    ...Fit the rich ecosystem of tools and frameworks into a common protocol for machine learning. Using industry-standard machine learning methods, implement endpoints for fetching data, training models, serving predictions (and much more) to write a complete ML stack in one place. Data science, ML engineering, and MLOps practitioners can all gather around UnionML apps as a way of defining a single source of truth about your ML system’s behavior. This helps you maintain consistent code across your ML stack, from training to prediction logic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    CodeContests

    CodeContests

    Large dataset of coding contests designed for AI and ML model training

    CodeContests, developed by Google DeepMind, is a large-scale competitive programming dataset designed for training and evaluating machine learning models on code generation and problem solving. This dataset played a central role in the development of AlphaCode, DeepMind’s model for solving programming problems at a human-competitive level, as published in Science. CodeContests aggregates problems and human-written solutions from multiple programming competition platforms, including AtCoder,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    data-science-on-gcp

    data-science-on-gcp

    Source code accompanying book: Data Science on the GCP

    ...The repository is organized into multiple directories that reflect real-world pipelines, such as ingesting data, running SQL-based analytics, streaming data processing, using Spark and Dataproc, applying BigQuery ML, and deploying models with Vertex AI. It emphasizes practical, production-oriented workflows rather than isolated examples, showing how different Google Cloud services interact to form cohesive pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Apple Neural Engine (ANE) Transformers

    Apple Neural Engine (ANE) Transformers

    Reference implementation of the Transformer architecture optimized

    ...It demonstrates how to structure attention and related layers to achieve substantial speedups and lower peak memory compared to baseline implementations when deployed to ANE. The repository targets practitioners who want to keep familiar PyTorch modeling while preparing models for Core ML/ANE execution paths. Documentation highlights reported improvements in throughput and memory residency, while releases track incremental fixes and packaging updates. The project sits alongside related Apple ML repos that focus on deploying attention-based models efficiently to ANE-equipped hardware. In short, it’s a practical blueprint for adapting Transformers to Apple’s dedicated ML accelerator without rewriting entire model stacks.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    CommandlineConfig

    CommandlineConfig

    A library for users to write configurations in Python

    ...One of its core strengths is the ability to override configuration values directly from the command line, making it convenient to run many experimental variants without editing files repeatedly. The library supports arbitrarily deep nested structures, type handling, enumerated value constraints, and even tuple types, which are common in ML experiment setups. It also includes features for automatic version checking and convenient help output, so users can quickly see available parameters and their descriptions via a -h flag.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Yellowbrick

    Yellowbrick

    Visual analysis and diagnostic tools to facilitate ML selection

    Yellowbrick extends the Scikit-Learn API to make model selection and hyperparameter tuning easier. Under the hood, it’s using Matplotlib. Yellowbrick is a suite of visual diagnostic tools called "Visualizers" that extend the scikit-learn API to allow human steering of the model selection process. In a nutshell, Yellowbrick combines scikit-learn with matplotlib in the best tradition of the scikit-learn documentation, but to produce visualizations for your machine learning workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Auto-PyTorch

    Auto-PyTorch

    Automatic architecture search and hyperparameter optimization

    While early AutoML frameworks focused on optimizing traditional ML pipelines and their hyperparameters, another trend in AutoML is to focus on neural architecture search. To bring the best of these two worlds together, we developed Auto-PyTorch, which jointly and robustly optimizes the network architecture and the training hyperparameters to enable fully automated deep learning (AutoDL). Auto-PyTorch is mainly developed to support tabular data (classification, regression) and time series data (forecasting). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    mlscraper

    mlscraper

    ML-based HTML scraper that learns extraction rules from examples

    mlscraper is a Python library designed to automatically extract structured data from HTML pages without requiring developers to manually write CSS selectors or XPath rules. Instead of defining extraction logic by hand, users provide a few examples of the data they want to retrieve from a webpage. It analyzes those examples within the HTML document and determines patterns or rules that can be used to extract the same type of information from similar pages. Once trained, the generated scraper...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Blankly

    Blankly

    Easily build, backtest and deploy your algo in just a few lines

    ...Models can be instantly backtested, paper traded, sandbox tested and run live by simply changing a single line. We built blankly for every type of quant including training & running ML models in the same environment, cross-exchange/cross-symbol arbitrage, and even long/short positions on stocks (all with built-in WebSockets). Blankly is the first framework to enable developers to backtest, paper trade, and go live across exchanges without modifying a single line of trading logic on stocks, crypto, and forex. Every model needs to figure out how to buy and sell. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    m2cgen

    m2cgen

    Transform ML models into a native code

    m2cgen (Model 2 Code Generator) - is a lightweight library that provides an easy way to transpile trained statistical models into a native code (Python, C, Java, Go, JavaScript, Visual Basic, C#, PowerShell, R, PHP, Dart, Haskell, Ruby, F#, Rust, Elixir). Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies. Some models force input data to be particular type during prediction phase in their native Python libraries. Currently, m2cgen works only with float64 (double) data type. You can try to cast your input data to another type manually and check results again. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Guild AI

    Guild AI

    Experiment tracking, ML developer tools

    Guild AI is an open-source experiment tracking toolkit designed to bring systematic control to machine learning workflows, enabling users to build better models faster. It automatically captures every detail of training runs as unique experiments, facilitating comprehensive tracking and analysis. Users can compare and analyze runs to deepen their understanding and incrementally improve models. Guild AI simplifies hyperparameter tuning by applying state-of-the-art algorithms through...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Machine Learning PyTorch Scikit-Learn

    Machine Learning PyTorch Scikit-Learn

    Code Repository for Machine Learning with PyTorch and Scikit-Learn

    Initially, this project started as the 4th edition of Python Machine Learning. However, after putting so much passion and hard work into the changes and new topics, we thought it deserved a new title. So, what’s new? There are many contents and additions, including the switch from TensorFlow to PyTorch, new chapters on graph neural networks and transformers, a new section on gradient boosting, and many more that I will detail in a separate blog post. For those who are interested in knowing...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Texar-PyTorch

    Texar-PyTorch

    Integrating the Best of TF into PyTorch, for Machine Learning

    Texar-PyTorch is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks. Texar provides a library of easy-to-use ML modules and functionalities for composing whatever models and algorithms. The tool is designed for both researchers and practitioners for fast prototyping and experimentation. Texar-PyTorch was originally developed and is actively contributed by Petuum and CMU in collaboration with other institutes. A mirror of this repository is maintained by Petuum Open Source. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Machine Learning Financial Laboratory

    Machine Learning Financial Laboratory

    MlFinLab helps portfolio managers and traders

    MlFinLab is a comprehensive Python library designed to support the development of machine learning strategies in quantitative finance and algorithmic trading. The project provides a large collection of tools that implement techniques from academic research on financial machine learning. It covers the full lifecycle of developing data-driven trading strategies, including data preprocessing, feature engineering, labeling techniques, model training, and performance evaluation. Many of the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    AI Platform Training and Prediction
    ...Although the repository has been archived, it still provides extensive reference implementations and practical examples for learning cloud-based ML workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    mlcourse.ai

    mlcourse.ai

    Open Machine Learning Course

    mlcourse.ai is an open Machine Learning course by OpenDataScience (ods.ai), led by Yury Kashnitsky (yorko). Having both a Ph.D. degree in applied math and a Kaggle Competitions Master tier, Yury aimed at designing an ML course with a perfect balance between theory and practice. Thus, the course meets you with math formulae in lectures, and a lot of practice in a form of assignments and Kaggle Inclass competitions. Currently, the course is in a self-paced mode. Here we guide you through the self-paced mlcourse.ai.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SBW (Systems Biology Workbench)

    SBW (Systems Biology Workbench)

    Framework for Systems Biology

    ...It comes with a large number of modules, encompassing the whole modeling cycle: creating computational models, simulating and analyzing them, visualizing the information, in order to improve the models. All using community standards, such as SED-ML, SBML and MIRIAM.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    AWS Step Functions Data Science SDK

    AWS Step Functions Data Science SDK

    For building machine learning (ML) workflows and pipelines on AWS

    The AWS Step Functions Data Science SDK is an open-source library that allows data scientists to easily create workflows that process and publish machine learning models using Amazon SageMaker and AWS Step Functions. You can create machine learning workflows in Python that orchestrate AWS infrastructure at scale, without having to provision and integrate the AWS services separately. The best way to quickly review how the AWS Step Functions Data Science SDK works is to review the related...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    igel

    igel

    Machine learning tool that allows you to train and test models

    ...Besides default values, igel can use auto-ml features to figure out a model that can work great with your data.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Music Source Separation

    Music Source Separation

    Separate audio recordings into individual sources

    Music Source Separation is a PyTorch-based open-source implementation for the task of separating a music (or audio) recording into its constituent sources — for example isolating vocals, instruments, bass, accompaniment, or background from a mixed track. It aims to give users the ability to take any existing song and decompose it into separate stems (vocals, accompaniment, etc.), or to train custom separation models on their own datasets (e.g. for speech enhancement, instrument isolation, or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ML workspace

    ML workspace

    All-in-one web-based IDE specialized for machine learning

    All-in-one web-based development environment for machine learning. The ML workspace is an all-in-one web-based IDE specialized for machine learning and data science. It is simple to deploy and gets you started within minutes to productively built ML solutions on your own machines. This workspace is the ultimate tool for developers preloaded with a variety of popular data science libraries (e.g., Tensorflow, PyTorch, Keras, Sklearn) and dev tools (e.g., Jupyter, VS Code, Tensorboard) perfectly configured, optimized, and integrated. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Photonix Photo Manager

    Photonix Photo Manager

    A modern, web-based photo management server

    ...Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms. This project is currently in development and not feature complete for a version 1.0 yet. If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for other contributors to get involved. You can move some photos into the folder data/photos and they should get detected and imported immediately. ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB