Open Source Linux Machine Learning Software - Page 9

Sort By:

Machine Learning Software for Linux

View 57 business solutions

Machine Learning Linux Clear Filters

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
1

LSTMs for Human Activity Recognition

Human Activity Recognition example using TensorFlow on smartphone

LSTM-Human-Activity-Recognition is a machine learning project that demonstrates how recurrent neural networks can be used to recognize human activities from sensor data. The repository implements a deep learning model based on Long Short-Term Memory (LSTM) networks to classify physical activities using time-series data collected from wearable sensors. The project uses the well-known Human Activity Recognition dataset derived from smartphone accelerometer and gyroscope signals. Through the use of sequential neural network architectures, the system learns patterns in motion data that correspond to activities such as walking, sitting, standing, or climbing stairs. The repository includes data preprocessing scripts, neural network architecture definitions, and training pipelines that allow researchers to reproduce and modify the experiments. It serves as an educational example of how deep learning models can process temporal sensor signals for pattern recognition tasks.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
2

Learning Interpretability Tool

Interactively analyze ML models to understand their behavior

The Learning Interpretability Tool (LIT, formerly known as the Language Interpretability Tool) is a visual, interactive ML model-understanding tool that supports text, image, and tabular data. It can be run as a standalone server, or inside of notebook environments such as Colab, Jupyter, and Google Cloud Vertex AI notebooks.

Downloads: 1 This Week

Last Update: 2024-12-20
See Project
3

LibrePhotos

A self-hosted open source photo management service

LibrePhotos is an open-source self-hosted photo management platform designed to organize, browse, and analyze personal media libraries while preserving user privacy. The system allows individuals to store and manage their photos and videos locally rather than relying on commercial cloud services. It provides features similar to services like Google Photos but runs on a private server controlled by the user. The application includes AI-powered tools that automatically analyze images to detect faces, objects, and locations, allowing photos to be grouped and searched more efficiently. LibrePhotos supports a wide variety of media formats and provides a web interface that can be accessed from different devices and operating systems. The platform is built using a Django backend and a React frontend, forming a full-stack web application architecture.

Downloads: 1 This Week

Last Update: 2026-03-15
See Project
4

LightGBM

Gradient boosting framework based on decision tree algorithms

LightGBM or Light Gradient Boosting Machine is a high-performance, open source gradient boosting framework based on decision tree algorithms. Compared to other boosting frameworks, LightGBM offers several advantages in terms of speed, efficiency and accuracy. Parallel experiments have shown that LightGBM can attain linear speed-up through multiple machines for training in specific settings, all while consuming less memory. LightGBM supports parallel and GPU learning, and can handle large-scale data. It’s become widely-used for ranking, classification and many other machine learning tasks.

Downloads: 1 This Week

Last Update: 2025-02-15
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Lightly

A python library for self-supervised learning on images

A python library for self-supervised learning on images. We, at Lightly, are passionate engineers who want to make deep learning more efficient. That's why - together with our community - we want to popularize the use of self-supervised methods to understand and curate raw image data. Our solution can be applied before any data annotation step and the learned representations can be used to visualize and analyze datasets. This allows selecting the best core set of samples for model training through advanced filtering. We provide PyTorch, PyTorch Lightning and PyTorch Lightning distributed examples for each of the models to kickstart your project. Lightly requires Python 3.6+ but we recommend using Python 3.7+. We recommend installing Lightly in a Linux or OSX environment. With lightly, you can use the latest self-supervised learning methods in a modular way using the full power of PyTorch. Experiment with different backbones, models, and loss functions.

Downloads: 1 This Week

Last Update: 2026-03-24
See Project
6

MLJ.jl

A Julia machine learning framework

MLJ (Machine Learning in Julia) is a toolbox written in Julia providing a common interface and meta-algorithms for selecting, tuning, evaluating, composing, and comparing about 200 machine learning models written in Julia and other languages. The functionality of MLJ is distributed over several repositories illustrated in the dependency chart below. These repositories live at the JuliaAI umbrella organization.

Downloads: 1 This Week

Last Update: 2026-03-24
See Project
7

Machine Learning From Scratch

Bare bones NumPy implementations of machine learning models

ML-From-Scratch is an open-source machine learning project that demonstrates how to implement common machine learning algorithms using only basic Python and NumPy rather than relying on high-level frameworks. The goal of the project is to help learners understand how machine learning algorithms work internally by building them step by step from fundamental mathematical operations. The repository includes implementations of algorithms ranging from simple models such as linear regression and logistic regression to more complex techniques such as decision trees, support vector machines, clustering methods, and neural networks. Because the code avoids external machine learning libraries, it exposes the full logic behind model training, optimization, and prediction processes. The project also provides examples and explanations that illustrate how the algorithms behave and how different components interact during training.

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
8

Machine Learning Glossary

Machine learning glossary

Machine Learning Glossary is an open educational project that provides clear explanations of machine learning terminology and concepts through visual diagrams and concise definitions. The goal of the repository is to make machine learning topics easier to understand by presenting definitions alongside examples, visual illustrations, and references for further learning. It covers a wide range of topics including neural networks, regression models, optimization techniques, loss functions, and evaluation metrics. The content is organized into sections that progressively introduce key ideas from basic machine learning concepts to more advanced mathematical topics. Many pages include diagrams or code examples to illustrate how algorithms work in practice. Because the project emphasizes accessibility, it is particularly useful for beginners who want a conceptual overview of machine learning terminology before diving into more technical research papers.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
9

Machine Learning Study

This repository is for helping those interested in machine learning

Machine Learning Study is an educational repository containing tutorials and study materials related to machine learning and data science using Python. The project compiles notebooks, explanatory documents, and practical code examples that illustrate common machine learning workflows. Topics covered include supervised learning algorithms, feature engineering, model training, and performance evaluation techniques. The repository is structured as a learning resource that guides readers through building machine learning models step by step. It often demonstrates how to implement algorithms using widely used libraries such as NumPy, pandas, scikit-learn, and TensorFlow. Many examples include dataset preparation, visualization of results, and experimentation with different modeling approaches.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
10

MachineLearningStocks

Using python and scikit-learn to make stock predictions

MachineLearningStocks is a Python-based template project that demonstrates how machine learning can be applied to predicting stock market performance. The project provides a structured workflow that collects financial data, processes features, trains predictive models, and evaluates trading strategies. Using libraries such as pandas and scikit-learn, the repository shows how historical financial indicators can be transformed into machine learning features. The model attempts to predict whether specific stocks will outperform a benchmark index such as the S&P 500. The repository includes scripts for parsing financial statistics, building training datasets, and performing backtesting to evaluate model performance over historical periods. Because it is structured as a template project, developers are encouraged to extend or modify the pipeline to test different algorithms, features, or investment strategies.

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
11

Made With ML

Learn how to develop, deploy and iterate on production-grade ML

Made-With-ML is an open-source educational repository and course designed to teach developers how to build production-grade machine learning systems using modern MLOps practices. The project focuses on bridging the gap between experimental machine learning notebooks and real-world software systems that can be deployed, monitored, and maintained at scale. It provides structured lessons and practical code examples that demonstrate how to design machine learning workflows, manage datasets, train models, evaluate performance, and deploy inference services. The repository organizes these concepts into modular Python scripts that follow software engineering best practices such as testing, configuration management, logging, and version control. Through a combination of tutorials, notebooks, and production-ready scripts, the project demonstrates how machine learning applications should be developed as maintainable systems rather than isolated experiments.

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
12

Materials Discovery: GNoME

AI discovers 520000 stable inorganic crystal structures for research

Materials Discovery (GNoME) is a large-scale research initiative by Google DeepMind focused on applying graph neural networks to accelerate the discovery of stable inorganic crystal materials. The project centers on Graph Networks for Materials Exploration (GNoME), a message-passing neural network architecture trained on density functional theory (DFT) data to predict material stability and energy formation. Using GNoME, DeepMind identified 381,000 new stable materials, later expanding the dataset to include over 520,000 materials within 1 meV/atom of the convex hull as of August 2024. The repository provides datasets, model definitions, and interactive Colabs for exploring these materials, computing decomposition energies, and visualizing chemical families. Additionally, it includes JAX-based implementations of GNoME and Nequip—the latter being used to train interatomic potentials for dynamic simulations.

Downloads: 1 This Week

Last Update: 2 days ago
See Project
13

NSFW Detection Machine Learning Model

Keras model of NSFW detector

Keras model of NSFW detector, NSFW Detection Machine Learning Model.

Downloads: 1 This Week

Last Update: 2024-08-15
See Project
14

NVIDIA PhysicsNeMo

Open-source deep-learning framework for building and training

NVIDIA PhysicsNeMo is an open-source deep learning framework designed for building artificial intelligence models that incorporate physical laws and scientific knowledge into machine learning workflows. The framework focuses on the emerging field of physics-informed machine learning, where neural networks are used alongside physical equations to model complex scientific systems. PhysicsNeMo provides modular Python components that allow developers to create scalable training and inference pipelines for models that combine data-driven learning with physics-based constraints. It is built on top of the PyTorch ecosystem and integrates with GPU-accelerated computing environments to handle computationally demanding simulations and datasets. The framework supports a wide range of scientific applications, including computational fluid dynamics, climate modeling, weather prediction, and engineering simulations.

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
15

Netflix Maestro

Netflix’s Workflow Orchestrator

Maestro is a large-scale workflow orchestration platform originally developed by Netflix to coordinate complex data processing and machine learning workflows across distributed systems. The system acts as a general-purpose workflow orchestrator that manages the execution, scheduling, monitoring, and recovery of large pipelines used for analytics and AI operations. It was designed to support the demanding internal infrastructure of Netflix, where thousands of workflows must process massive volumes of data reliably and efficiently every day. The platform enables engineers and data scientists to define workflows using structured configuration files and execute tasks across diverse compute environments, including scripts, containers, and notebook environments. Maestro provides built-in mechanisms for retry logic, task scheduling, dependency management, and error handling, which are essential when orchestrating production-scale pipelines.

Downloads: 1 This Week

Last Update: 2026-03-24
See Project
16

NeuralForecast

Scalable and user friendly neural forecasting algorithms.

NeuralForecast offers a large collection of neural forecasting models focusing on their performance, usability, and robustness. The models range from classic networks like RNNs to the latest transformers: MLP, LSTM, GRU, RNN, TCN, TimesNet, BiTCN, DeepAR, NBEATS, NBEATSx, NHITS, TiDE, DeepNPTS, TSMixer, TSMixerx, MLPMultivariate, DLinear, NLinear, TFT, Informer, AutoFormer, FedFormer, PatchTST, iTransformer, StemGNN, and TimeLLM. There is a shared belief in Neural forecasting methods' capacity to improve forecasting pipeline's accuracy and efficiency. Unfortunately, available implementations and published research are yet to realize neural networks' potential. They are hard to use and continuously fail to improve over statistical methods while being computationally prohibitive. For this reason, we created NeuralForecast, a library favoring proven accurate and efficient models focusing on their usability.

Downloads: 1 This Week

Last Update: 6 days ago
See Project
17

NeuroMatch Academy (NMA)

NMA Computational Neuroscience course

NMA Computational Neuroscience course. We have curated a curriculum that spans most areas of computational neuroscience (a hard task in an increasingly big field!). We will expose you to both theoretical modeling and more data-driven analyses. The Neuro Video Series is a series of 12 videos that covers basic neuroscience concepts and neuroscience methods. These videos are completely optional and do not need to be watched in a fixed order so you can pick and choose which videos will help you brush up on your knowledge. The pre-reqs refresher days are asynchronous, so you can go through the material on your own time. You will learn how to code in Python from scratch using a simple neural model, the leaky integrate-and-fire model, as a motivation. Then, you will cover linear algebra, calculus and probability & statistics. The topics covered on these days were carefully chosen based on what you need for the comp neuro course.

Downloads: 1 This Week

Last Update: 2026-03-20
See Project
18

OGB

Benchmark datasets, data loaders, and evaluators for graph machine

The Open Graph Benchmark (OGB) is a collection of realistic, large-scale, and diverse benchmark datasets for machine learning on graphs. OGB datasets are automatically downloaded, processed, and split using the OGB Data Loader. The model performance can be evaluated using the OGB Evaluator in a unified manner. OGB is a community-driven initiative in active development. We expect the benchmark datasets to evolve. OGB provides a diverse set of challenging and realistic benchmark datasets that are of varying sizes and cover a variety graph machine learning tasks, including prediction of node, link, and graph properties. OGB fully automates dataset processing. The OGB data loaders automatically download and process graphs, provide graph objects that are fully compatible with Pytorch Geometric and DGL. OGB provides standardized dataset splits and evaluators that allow for easy and reliable comparison of different models in a unified manner.

Downloads: 1 This Week

Last Update: 2023-04-07
See Project
19

OneFlow

OneFlow is a deep learning framework designed to be user-friendly

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient. An extension for OneFlow to target third-party compiler, such as XLA, TensorRT and OpenVINO etc.CUDA runtime is statically linked into OneFlow. OneFlow will work on a minimum supported driver, and any driver beyond. For more information. Distributed performance (efficiency) is the core technical difficulty of the deep learning framework. OneFlow focuses on performance improvement and heterogeneous distributed expansion. It adheres to the core concept and architecture of static compilation and streaming parallelism and solves the memory wall challenge at the cluster level. world-leading level. Provides a variety of services from primary AI talent training to enterprise-level machine learning lifecycle integrated management (MLOps), including AI training and AI development, and supports three deployment modes of public cloud, private cloud and hybrid cloud.

Downloads: 1 This Week

Last Update: 2024-03-11
See Project
20

OpenMLDB

OpenMLDB is an open-source machine learning database

OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference. OpenMLDB is an open-source machine learning database that is committed to solving the data and feature challenges. OpenMLDB has been deployed in hundreds of real-world enterprise applications. It prioritizes the capability of feature engineering using SQL for open-source, which offers a feature platform enabling consistent features for training and inference. Real-time features are essential for many machine learning applications, such as real-time personalized recommendations and risk analytics. However, a feature engineering script developed by data scientists (Python scripts in most cases) cannot be directly deployed into production for online inference because it usually cannot meet the engineering requirements, such as low latency, high throughput and high availability.

Downloads: 1 This Week

Last Update: 2025-02-21
See Project
21

OpenMLSys-ZH

Machine Learning Systems: Design and Implementation

This repository is the Chinese translation (or localization) of the OpenMLSys project documentation. Its aim is to make the technical content, tutorials, architecture descriptions, and user guides of the OpenMLSys system more accessible to Chinese-speaking users. The repo mirrors the structure of the original OpenMLSys docs: sections on system design, API references, deployment instructions, module overviews, and example workflows. It helps bridge language barriers in open machine learning systems by providing side-by-side translation or localized explanations. The repository includes scripts or tooling to keep translation synchronized with upstream changes, versioning, and possibly translation metadata (contributors, timestamp). Users can browse or clone the translated documentation to follow along with the original content, deploy examples, or understand system internals in their preferred language.

Downloads: 1 This Week

Last Update: 2026-03-15
See Project
22

OpenVINO Notebooks

Jupyter notebook tutorials for OpenVINO

openvino_notebooks is a collection of interactive Jupyter notebooks designed to demonstrate how to build, optimize, and deploy artificial intelligence applications using the OpenVINO toolkit. The repository provides practical tutorials that guide developers through various AI workflows including computer vision, natural language processing, and generative AI tasks. Each notebook demonstrates how to run pre-trained models, optimize inference performance, and deploy models across hardware such as CPUs, GPUs, and specialized accelerators. The tutorials also illustrate how OpenVINO integrates with models from frameworks like PyTorch, TensorFlow, and ONNX to accelerate inference workloads. Many notebooks include end-to-end examples that show how to prepare input data, load optimized models, run inference, and visualize results. The project is particularly useful for developers who want to learn how to optimize machine learning inference pipelines for production environments.

Downloads: 1 This Week

Last Update: 16 hours ago
See Project
23

PML

The easiest way to use deep metric learning in your application

This library contains 9 modules, each of which can be used independently within your existing codebase, or combined together for a complete train/test workflow. To compute the loss in your training loop, pass in the embeddings computed by your model, and the corresponding labels. The embeddings should have size (N, embedding_size), and the labels should have size (N), where N is the batch size. The TripletMarginLoss computes all possible triplets within the batch, based on the labels you pass into it. Anchor-positive pairs are formed by embeddings that share the same label, and anchor-negative pairs are formed by embeddings that have different labels. Loss functions can be customized using distances, reducers, and regularizers. In the diagram below, a miner finds the indices of hard pairs within a batch. These are used to index into the distance matrix, computed by the distance object. For this diagram, the loss function is pair-based, so it computes a loss per pair.

Downloads: 1 This Week

Last Update: 2025-08-17
See Project
24

PaddlePaddle

PArallel Distributed Deep LEarning: Machine Learning Framework

PaddlePaddle is an open source deep learning industrial platform with advanced technologies and a rich set of features that make innovation and application of deep learning easier. It is the only independent R&D deep learning platform in China, and has been widely adopted in various sectors including manufacturing, agriculture and enterprise service. PaddlePaddle covers core deep learning frameworks, basic model libraries, end-to-end development kits and more, with support for both dynamic and static graphs.

Downloads: 1 This Week

Last Update: 2026-01-31
See Project
25

Papers with Code

List of different papers for coding

pwc is an open-source repository that compiles machine learning and artificial intelligence research papers together with their corresponding implementation code. The project functions as a curated dataset linking academic publications with practical software implementations, allowing researchers and engineers to quickly locate code that reproduces published results. The repository organizes information such as paper titles, conferences, and links to code implementations so that users can explore recent developments in machine learning more efficiently. It was originally created to support the discovery and reproducibility of AI research by connecting scholarly work with working software projects. Although the repository itself is no longer actively maintained, it still provides a historical dataset that reflects many influential research publications and their associated implementations.

Downloads: 1 This Week

Last Update: 2026-03-10
See Project