Open Source Linux Machine Learning Software - Page 6

Sort By:

Machine Learning Software for Linux

View 57 business solutions

Machine Learning Linux Clear Filters

AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
1

GROBID

A machine learning software for extracting information

GROBID is a machine learning library for extracting, parsing, and re-structuring raw documents such as PDF into structured XML/TEI encoded documents with a particular focus on technical and scientific publications. First developments started in 2008 as a hobby. In 2011 the tool has been made available in open source. Work on GROBID has been steady as a side project since the beginning and is expected to continue as such. Header extraction and parsing from article in PDF format. The extraction here covers the usual bibliographical information (e.g. title, abstract, authors, affiliations, keywords, etc.). References extraction and parsing from articles in PDF format, around .87 F1-score against on an independent PubMed Central set of 1943 PDF containing 90,125 references, and around .89 on a similar bioRxiv set of 2000 PDF (using the Deep Learning citation model). All the usual publication metadata are covered (including DOI, PMID, etc.).

Downloads: 2 This Week

Last Update: 2025-05-11
See Project
2

GluonTS

Probabilistic time series modeling in Python

GluonTS is a Python package for probabilistic time series modeling, focusing on deep learning based models. GluonTS requires Python 3.6 or newer, and the easiest way to install it is via pip. We train a DeepAR-model and make predictions using the simple "airpassengers" dataset. The dataset consists of a single time-series, containing monthly international passengers between the years 1949 and 1960, a total of 144 values (12 years * 12 months). We split the dataset into train and test parts, by removing the last three years (36 months) from the train data. Thus, we will train a model on just the first nine years of data. Python has the notion of extras – dependencies that can be optionally installed to unlock certain features of a package. We make extensive use of optional dependencies in GluonTS to keep the amount of required dependencies minimal. To still allow users to opt-in to certain features, we expose many extra dependencies.

Downloads: 2 This Week

Last Update: 2025-06-27
See Project
3

HDBSCAN

A high performance implementation of HDBSCAN clustering

HDBSCAN - Hierarchical Density-Based Spatial Clustering of Applications with Noise. Performs DBSCAN over varying epsilon values and integrates the result to find a clustering that gives the best stability over epsilon. This allows HDBSCAN to find clusters of varying densities (unlike DBSCAN), and be more robust to parameter selection. In practice this means that HDBSCAN returns a good clustering straight away with little or no parameter tuning -- and the primary parameter, minimum cluster size, is intuitive and easy to select. HDBSCAN is ideal for exploratory data analysis; it's a fast and robust algorithm that you can trust to return meaningful clusters (if there are any).

Downloads: 2 This Week

Last Update: 6 days ago
See Project
4

IREE

A retargetable MLIR-based machine learning compiler runtime toolkit

IREE (Intermediate Representation Execution Environment, pronounced as "eerie") is an MLIR-based end-to-end compiler and runtime that lowers Machine Learning (ML) models to a unified IR that scales up to meet the needs of the data center and down to satisfy the constraints and special considerations of mobile and edge deployments.

Downloads: 2 This Week

Last Update: 2026-03-19
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Interpretable machine learning

Book about interpretable machine learning

This book is about interpretable machine learning. Machine learning is being built into many products and processes of our daily lives, yet decisions made by machines don't automatically come with an explanation. An explanation increases the trust in the decision and in the machine learning model. As the programmer of an algorithm you want to know whether you can trust the learned model. Did it learn generalizable features? Or are there some odd artifacts in the training data which the algorithm picked up? This book will give an overview over techniques that can be used to make black boxes as transparent as possible and explain decisions. In the first chapter algorithms that produce simple, interpretable models are introduced together with instructions how to interpret the output. The later chapters focus on analyzing complex models and their decisions. In an ideal future, machines will be able to explain their decisions and make a transition into an algorithmic age more human.

Downloads: 2 This Week

Last Update: 2025-03-13
See Project
6

Learn PyTorch for Deep Learning

Materials for the Learn PyTorch for Deep Learning

Learn PyTorch for Deep Learning is an open-source educational repository that provides the full learning materials for the “Learn PyTorch for Deep Learning: Zero to Mastery” course created by Daniel Bourke. The project is designed to teach beginners how to build deep learning models using PyTorch through a hands-on, code-first learning approach. Instead of focusing heavily on theory alone, the repository encourages learners to experiment with code and develop practical machine learning skills through guided examples and exercises. The materials include Jupyter notebooks, explanations of core deep learning concepts, and step-by-step demonstrations of building and training neural networks. Throughout the lessons, users learn how to work with tensors, create neural network architectures, manage training workflows, and evaluate model performance.

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
7

ML Visuals

ML Visuals contains figures and templates which you can reuse

ML Visuals is an open-source project that provides a collection of reusable diagrams, templates, and visual resources designed to improve communication in machine learning research and education. The repository contains professional-quality figures that illustrate machine learning concepts such as neural networks, optimization methods, model architectures, and common deep learning techniques. These visuals are intended to help researchers, educators, and students create clearer presentations, blog posts, and scientific papers. The project is maintained as a collaborative community effort where contributors can add new diagrams or visual components. Many of the visuals are designed using editable formats such as Google Slides, making it easy for users to customize them for their own work.

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
8

ML workspace

All-in-one web-based IDE specialized for machine learning

All-in-one web-based development environment for machine learning. The ML workspace is an all-in-one web-based IDE specialized for machine learning and data science. It is simple to deploy and gets you started within minutes to productively built ML solutions on your own machines. This workspace is the ultimate tool for developers preloaded with a variety of popular data science libraries (e.g., Tensorflow, PyTorch, Keras, Sklearn) and dev tools (e.g., Jupyter, VS Code, Tensorboard) perfectly configured, optimized, and integrated. Usable as remote kernel (Jupyter) or remote machine (VS Code) via SSH. Easy to deploy on Mac, Linux, and Windows via Docker. Jupyter, JupyterLab, and Visual Studio Code web-based IDEs.By default, the workspace container has no resource constraints and can use as much of a given resource as the host’s kernel scheduler allows.

Downloads: 2 This Week

Last Update: 2022-07-12
See Project
9

MLflow

Open source platform for the machine learning lifecycle

MLflow is a platform to streamline machine learning development, including tracking experiments, packaging code into reproducible runs, and sharing and deploying models. MLflow offers a set of lightweight APIs that can be used with any existing machine learning application or library (TensorFlow, PyTorch, XGBoost, etc), wherever you currently run ML code (e.g. in notebooks, standalone applications or the cloud).

Downloads: 2 This Week

Last Update: 2026-03-05
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

MNE-Python

Magnetoencephalography (MEG) and Electroencephalography EEG in Python

Open-source Python package for exploring, visualizing, and analyzing human neurophysiological data. MNE-Python is an open-source Python package for exploring, visualizing, and analyzing human neurophysiological data such as MEG, EEG, sEEG, ECoG, and more. It includes modules for data input/output, preprocessing, visualization, source estimation, time-frequency analysis, connectivity analysis, machine learning, statistics, and more.

Downloads: 2 This Week

Last Update: 2025-11-21
See Project
11

MONAI

AI Toolkit for Healthcare Imaging

The MONAI framework is the open-source foundation being created by Project MONAI. MONAI is a freely available, community-supported, PyTorch-based framework for deep learning in healthcare imaging. It provides domain-optimized foundational capabilities for developing healthcare imaging training workflows in a native PyTorch paradigm. Project MONAI also includes MONAI Label, an intelligent open source image labeling and learning tool that helps researchers and clinicians collaborate, create annotated datasets, and build AI models in a standardized MONAI paradigm. MONAI is an open-source project. It is built on top of PyTorch and is released under the Apache 2.0 license. Aiming to capture best practices of AI development for healthcare researchers, with an immediate focus on medical imaging. Providing user-comprehensible error messages and easy to program API interfaces. Provides reproducibility of research experiments for comparisons against state-of-the-art implementations.

Downloads: 2 This Week

Last Update: 2026-01-27
See Project
12

MagicMirror²

Modular smart mirror platform with a list of installable modules

MagicMirror² is Open Source, free and maintained by a big group of enthusiasts. Got a nice idea? Send us a pull request and become a part of the big list of contributors. The core of MagicMirror² contains a strong API which allows 3rd party developers to build additional modules. Modules you can use. Modules you can develop. Read our extensive documentation to find out everything you want to know about the MagicMirror² project. The full API description allows you to build your own modules. On the forum you will find a big list of MagicMirror² enthusiasts. Share your ideas, ask your questions and get support. The perfect place for you to start. MagicMirror² has an extensively documentated API. It allows you to built your own module backed by a powerful backend. Check out the API documentation for more information and start developing today.

Downloads: 2 This Week

Last Update: 23 hours ago
See Project
13

MegEngine

Easy-to-use deep learning framework with 3 key features

MegEngine is a fast, scalable and easy-to-use deep learning framework with 3 key features. You can represent quantization/dynamic shape/image pre-processing and even derivation in one model. After training, just put everything into your model and inference it on any platform at ease. Speed and precision problems won't bother you anymore due to the same core inside. In training, GPU memory usage could go down to one-third at the cost of only one additional line, which enables the DTR algorithm. Gain the lowest memory usage when inferencing a model by leveraging our unique pushdown memory planner. NOTE: MegEngine now supports Python installation on Linux-64bit/Windows-64bit/MacOS(CPU-Only)-10.14+/Android 7+(CPU-Only) platforms with Python from 3.5 to 3.8. On Windows 10 you can either install the Linux distribution through Windows Subsystem for Linux (WSL) or install the Windows distribution directly. Many other platforms are supported for inference.

Downloads: 2 This Week

Last Update: 2024-04-30
See Project
14

MindsDB

Making Enterprise Data Intelligent and Responsive for AI

MindsDB is an AI data solution that enables humans, AI, agents, and applications to query data in natural language and SQL, and get highly accurate answers across disparate data sources and types. MindsDB connects to diverse data sources and applications, and unifies petabyte-scale structured and unstructured data. Powered by an industry-first cognitive engine that can operate anywhere (on-prem, VPC, serverless), it empowers both humans and AI with highly informed decision-making capabilities. A federated query engine that tidies up your data-sprawl chaos while meticulously answering every single question you throw at it. MindsDB has an MCP server built in that enables your MCP applications to connect, unify and respond to questions over large-scale federated data—spanning databases, data warehouses, and SaaS applications.

Downloads: 2 This Week

Last Update: 2026-03-03
See Project
15

MiniSom

MiniSom is a minimalistic implementation of the Self Organizing Maps

MiniSom is a minimalistic and Numpy-based implementation of the Self Organizing Maps (SOM). SOM is a type of Artificial Neural Network able to convert complex, nonlinear statistical relationships between high-dimensional data items into simple geometric relationships on a low-dimensional display. Minisom is designed to allow researchers to easily build on top of it and to give students the ability to quickly grasp its details. The project initially aimed for a minimalistic implementation of the Self-Organizing Map (SOM) algorithm, focusing on simplicity in features, dependencies, and code style. Although it has expanded in terms of features, it remains minimalistic by relying only on the numpy library and emphasizing vectorization in coding style.

Downloads: 2 This Week

Last Update: 2026-01-14
See Project
16

Neural Network Intelligence

AutoML toolkit for automate machine learning lifecycle

Neural Network Intelligence is an open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning. NNI (Neural Network Intelligence) is a lightweight but powerful toolkit to help users automate feature engineering, neural architecture search, hyperparameter tuning and model compression. The tool manages automated machine learning (AutoML) experiments, dispatches and runs experiments' trial jobs generated by tuning algorithms to search the best neural architecture and/or hyper-parameters in different training environments like Local Machine, Remote Servers, OpenPAI, Kubeflow, FrameworkController on K8S (AKS etc.) DLWorkspace (aka. DLTS) AML (Azure Machine Learning) and other cloud options. NNI provides CommandLine Tool as well as an user friendly WebUI to manage training experiements.

Downloads: 2 This Week

Last Update: 2023-09-13
See Project
17

PumpkinBook

Machine Learning formula derivation and analysis

All the contents of the Pumpkin Book are expressed with the content of the Mr. Zhou Zhihua's "Machine Learning" Watermelon Book as the pre-knowledge, so the best way to use the Pumpkin Book is to use the Watermelon Book as the main line. Please refer to it when you encounter a formula that you cannot derive or cannot understand. We strive to explain and derive each formula from the perspective of undergraduate mathematics. Therefore, we usually give out the mathematics knowledge of the super program in the form of appendices and references. Interested students can Continue to learn in depth along the information we gave. For beginners who are new to machine learning, the formulas in Chapter 1 and Chapter 2 of Watermelon Book are strongly not recommended to go deep . You can simply go over it, and it will be too late to come back and chew when you learn a little.

Downloads: 2 This Week

Last Update: 2023-11-17
See Project
18

PyTorch Geometric

Geometric deep learning extension library for PyTorch

It consists of various methods for deep learning on graphs and other irregular structures, also known as geometric deep learning, from a variety of published papers. In addition, it consists of an easy-to-use mini-batch loader for many small and single giant graphs, a large number of common benchmark datasets (based on simple interfaces to create your own), and helpful transforms, both for learning on arbitrary graphs as well as on 3D meshes or point clouds. We have outsourced a lot of functionality of PyTorch Geometric to other packages, which needs to be additionally installed. These packages come with their own CPU and GPU kernel implementations based on C++/CUDA extensions. We do not recommend installation as root user on your system python. Please setup an Anaconda/Miniconda environment or create a Docker image. We provide pip wheels for all major OS/PyTorch/CUDA combinations.

Downloads: 2 This Week

Last Update: 2025-10-14
See Project
19

PyTorch Transfer-Learning-Library

Transfer Learning Library for Domain Adaptation, Task Adaptation, etc.

TLlib is an open-source and well-documented library for Transfer Learning. It is based on pure PyTorch with high performance and friendly API. Our code is pythonic, and the design is consistent with torchvision. You can easily develop new algorithms or readily apply existing algorithms. We appreciate all contributions. If you are planning to contribute back bug-fixes, please do so without any further discussion. If you plan to contribute new features, utility functions or extensions, please first open an issue and discuss the feature with us.

Downloads: 2 This Week

Last Update: 2022-08-10
See Project
20

Quantitative Trading System

A comprehensive quantitative trading system with AI-powered analysis

Quantitative Trading System is a comprehensive quantitative trading platform that integrates artificial intelligence, financial data analysis, and automated strategy execution within a unified software system. The project is designed to provide an end-to-end infrastructure for building and operating algorithmic trading strategies in financial markets. It includes tools for collecting and processing market data from multiple sources, performing statistical and machine learning analysis, and generating trading signals based on quantitative models. The system supports real-time data streaming, allowing strategies to respond to market conditions as they evolve. QuantMuse also incorporates advanced risk management features, including portfolio monitoring, risk limits, and dynamic position sizing to control exposure.

Downloads: 2 This Week

Last Update: 2026-03-12
See Project
21

RF-DETR

RF-DETR is a real-time object detection and segmentation

RF-DETR is an open-source computer vision framework that implements a real-time object detection and instance segmentation model based on transformer architectures. Developed by Roboflow, the project builds upon modern vision transformer backbones such as DINOv2 to achieve strong accuracy while maintaining efficient inference speeds suitable for real-time applications. The model is designed to detect objects and segment them within images or video streams using a unified detection pipeline. RF-DETR emphasizes strong performance across both accuracy and latency benchmarks, allowing developers to deploy high-quality detection models in applications that require immediate processing such as robotics, autonomous systems, and industrial inspection. The repository includes Python packages, training scripts, and model configurations that enable researchers and engineers to train and deploy detection models on custom datasets.

Downloads: 2 This Week

Last Update: 6 days ago
See Project
22

SageMaker Python SDK

Training and deploying machine learning models on Amazon SageMaker

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker. With the SDK, you can train and deploy models using popular deep learning frameworks Apache MXNet and TensorFlow. You can also train and deploy models with Amazon algorithms, which are scalable implementations of core machine learning algorithms that are optimized for SageMaker and GPU training. If you have your own algorithms built into SageMaker-compatible Docker containers, you can train and host models using these as well.

Downloads: 2 This Week

Last Update: 2 days ago
See Project
23

SkyPilot

SkyPilot: Run AI and batch jobs on any infra

SkyPilot is a framework for running AI and batch workloads on any infra, offering unified execution, high cost savings, and high GPU availability. Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Downloads: 2 This Week

Last Update: 2026-03-24
See Project
24

TensorFlow Documentation

TensorFlow documentation

An end-to-end platform for machine learning. TensorFlow makes it easy to create ML models that can run in any environment. Learn how to use the intuitive APIs through interactive code samples.

Downloads: 2 This Week

Last Update: 2024-08-02
See Project
25

The Unsplash Dataset

Unsplash images made available for research and machine learning

The Unsplash Dataset is made up of over 350,000+ contributing global photographers and data sourced from hundreds of millions of searches across a nearly unlimited number of uses and contexts. Due to the breadth of intent and semantics contained within the Unsplash dataset, it enables new opportunities for research and learning.

Downloads: 2 This Week

Last Update: 2025-04-16
See Project