Open Source Linux Machine Learning Software - Page 10

Sort By:

Machine Learning Software for Linux

View 57 business solutions

Machine Learning Linux Clear Filters

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
1

Pwnagotchi

Deep Reinforcement learning instrumenting bettercap for WiFi pwning

Pwnagotchi is an A2C-based “AI” powered by bettercap and running on a Raspberry Pi Zero W that learns from its surrounding WiFi environment in order to maximize the crackable WPA key material it captures (either through passive sniffing or by performing deauthentication and association attacks). This material is collected on disk as PCAP files containing any form of handshake supported by hashcat, including full and half WPA handshakes as well as PMKIDs. Instead of merely playing Super Mario or Atari games like most reinforcement learning based “AI” (yawn), Pwnagotchi tunes its own parameters over time to get better at pwning WiFi things in the real world environments you expose it to. To give hackers an excuse to learn about reinforcement learning and WiFi networking, and have a reason to get out for more walks.

Downloads: 1 This Week

Last Update: 2021-11-29
See Project
2

PyBroker

Algorithmic Trading in Python with Machine Learning

Are you looking to enhance your trading strategies with the power of Python and machine learning? Then you need to check out PyBroker! This Python framework is designed for developing algorithmic trading strategies, with a focus on strategies that use machine learning. With PyBroker, you can easily create and fine-tune trading rules, build powerful models, and gain valuable insights into your strategy’s performance.

Downloads: 1 This Week

Last Update: 2026-03-05
See Project
3

PyCaret

An open-source, low-code machine learning library in Python

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive. In comparison with the other open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few lines only. This makes experiments exponentially fast and efficient. PyCaret is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, Optuna, Hyperopt, Ray, and few more. The design and simplicity of PyCaret are inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more technical expertise.

Downloads: 1 This Week

Last Update: 2024-08-01
See Project
4

PyG

Graph Neural Network Library for PyTorch

PyG (PyTorch Geometric) is a library built upon PyTorch to easily write and train Graph Neural Networks (GNNs) for a wide range of applications related to structured data. It consists of various methods for deep learning on graphs and other irregular structures, also known as geometric deep learning, from a variety of published papers. In addition, it consists of easy-to-use mini-batch loaders for operating on many small and single giant graphs, multi GPU-support, DataPipe support, distributed graph learning via Quiver, a large number of common benchmark datasets (based on simple interfaces to create your own), the GraphGym experiment manager, and helpful transforms, both for learning on arbitrary graphs as well as on 3D meshes or point clouds. All it takes is 10-20 lines of code to get started with training a GNN model (see the next section for a quick tour).

Downloads: 1 This Week

Last Update: 2025-10-14
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Python Code Tutorials

The Python Code Tutorials

Python Code Tutorials is a large educational repository that aggregates programming tutorials from the “The Python Code” website into a structured collection of Python projects and learning materials. The repository covers a wide range of programming topics including cybersecurity, networking, web scraping, machine learning, GUI development, and automation scripts. Each tutorial typically includes complete Python code examples and explanations that demonstrate how to build real tools and applications step by step. Many tutorials focus on practical implementations such as building network scanners, web scraping tools, object detection systems, and automation utilities using Python libraries. The repository is organized into thematic directories that group tutorials by topic, allowing learners to navigate easily between areas such as ethical hacking, multimedia processing, or machine learning.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
6

Python Machine Learning 3rd Ed.

The "Python Machine Learning (3rd edition)" book code repository

Python Machine Learning 3rd Ed. repository contains the complete source code that accompanies the book Python Machine Learning by Sebastian Raschka and collaborators. The project provides implementations of machine learning algorithms and data science workflows described in the book, enabling readers to experiment with real code while studying theoretical concepts. The repository includes Python notebooks and scripts demonstrating techniques such as data preprocessing, classification, regression, clustering, neural networks, and model evaluation. These examples are designed to illustrate how machine learning algorithms operate internally and how they can be applied to real datasets. Many examples rely on widely used libraries such as NumPy, scikit-learn, and deep learning frameworks to demonstrate modern machine learning workflows.

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
7

Python Programming Hub

Learn Python and Machine Learning from scratch

Python Programming Hub repository by Tanu-N-Prabhu is an educational resource designed to help programmers learn Python programming and data science concepts through practical examples and notebooks. The project contains a wide range of tutorials and exercises that cover Python fundamentals, programming concepts, and applied techniques for data analysis and machine learning. Many sections are implemented as Jupyter notebooks, allowing learners to run code interactively while reading explanations of the concepts involved. The repository emphasizes hands-on learning by demonstrating real programming tasks such as data manipulation, statistical analysis, visualization, and automation. It also includes examples of commonly used libraries such as NumPy, Pandas, and other tools used in data science workflows.

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
8

QuantResearch

Quantitative analysis, strategies and backtests

QuantResearch is a large educational repository dedicated to quantitative finance, algorithmic trading, and financial machine learning research. The project contains numerous notebooks and research materials demonstrating quantitative analysis techniques used in financial markets. These include implementations of factor models, statistical arbitrage strategies, portfolio optimization methods, and reinforcement learning approaches to trading. The repository also explores financial modeling topics such as vector autoregression, Gaussian mixture models, and option pricing techniques. Many notebooks demonstrate backtesting pipelines that allow users to evaluate trading strategies using historical market data. The project integrates machine learning methods with traditional quantitative finance models, illustrating how statistical techniques can be applied to asset management and trading.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
9

Rasa

Open source machine learning framework to automate text conversations

Rasa is an open source machine learning framework to automate text-and voice-based conversations. With Rasa, you can build contextual assistants on Facebook Messenger, Slack, Google Hangouts, Webex Teams, Microsoft Bot Framework, Rocket.Chat, Mattermost, Telegram, and Twilio or on your own custom conversational channels. Rasa helps you build contextual assistants capable of having layered conversations with lots of back-and-forths. In order for a human to have a meaningful exchange with a contextual assistant, the assistant needs to be able to use context to build on things that were previously discussed. Rasa enables you to build assistants that can do this in a scalable way. Rasa uses Poetry for packaging and dependency management. If you want to build it from the source, you have to install Poetry first. By default, Poetry will try to use the currently activated Python version to create the virtual environment for the current project automatically.

Downloads: 1 This Week

Last Update: 2025-01-14
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
10

Raster Vision

Open source framework for deep learning satellite and aerial imagery

Raster Vision is an open source framework for Python developers building computer vision models on satellite, aerial, and other large imagery sets (including oblique drone imagery). There is built-in support for chip classification, object detection, and semantic segmentation using PyTorch. Raster Vision allows engineers to quickly and repeatably configure pipelines that go through core components of a machine learning workflow: analyzing training data, creating training chips, training models, creating predictions, evaluating models, and bundling the model files and configuration for easy deployment. The input to a Raster Vision pipeline is a set of images and training data, optionally with Areas of Interest (AOIs) that describe where the images are labeled. The output of a Raster Vision pipeline is a model bundle that allows you to easily utilize models in various deployment scenarios.

Downloads: 1 This Week

Last Update: 2024-08-30
See Project
11

SGX-Full-OrderBook-Tick-Data-Trading

Providing the solutions for high-frequency trading (HFT) strategies

SGX-Full-OrderBook-Tick-Data-Trading-Strategy is an open-source research project focused on modeling high-frequency financial market behavior using machine learning techniques. The repository analyzes tick-level order book data from the Singapore Exchange and attempts to capture the dynamics of limit order book movements. By extracting features such as order depth ratios and price movement indicators, the system trains machine learning models to predict short-term market changes. Several algorithms are used during model selection, including Random Forest, Extra Trees, AdaBoost, Gradient Boosting, and Support Vector Machines. The project evaluates models by predicting price direction within very short time windows and then applying a simple trading strategy based on those predictions. It also measures profitability through profit-and-loss analysis derived from the predicted signals.

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
12

SHAP

A game theoretic approach to explain the output of ml models

SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model. It connects optimal credit allocation with local explanations using the classic Shapley values from game theory and their related extensions. While SHAP can explain the output of any machine learning model, we have developed a high-speed exact algorithm for tree ensemble methods. Fast C++ implementations are supported for XGBoost, LightGBM, CatBoost, scikit-learn and pyspark tree models. To understand how a single feature effects the output of the model we can plot the SHAP value of that feature vs. the value of the feature for all the examples in a dataset. Since SHAP values represent a feature's responsibility for a change in the model output, the plot below represents the change in predicted house price as RM (the average number of rooms per house in an area) changes.

Downloads: 1 This Week

Last Update: 2026-03-03
See Project
13

Semantra

Multi-tool for semantic search

Semantra is an open-source semantic search tool designed to help users explore large collections of documents by meaning rather than simple keyword matching. The software analyzes text and PDF documents stored locally and creates embeddings that allow queries to retrieve results based on conceptual similarity. It is primarily intended for individuals who need to extract insights from large document collections, including researchers, journalists, students, and historians. The system runs from the command line and automatically launches a local web interface where users can perform interactive searches and examine document passages related to a query. By relying on semantic embeddings and contextual analysis, the tool can identify passages that are relevant even when the query uses different wording than the source documents.

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
14

Shogun

Unified and efficient Machine Learning since 1999

Shogun is a unified and efficient Machine Learning since 1999. Shogun is implemented in C++ and offers automatically generated, unified interfaces to Python, Octave, Java / Scala, Ruby, C#, R, Lua. We are currently working on adding more languages including JavaScript, D, and Matlab.

Downloads: 1 This Week

Last Update: 2024-08-07
See Project
15

Stanford Machine Learning Course

machine learning course programming exercise

The Stanford Machine Learning Course Exercises repository contains programming assignments from the well-known Stanford Machine Learning online course. It includes implementations of a variety of fundamental algorithms using Python and MATLAB/Octave. The repository covers a broad set of topics such as linear regression, logistic regression, neural networks, clustering, support vector machines, and recommender systems. Each folder corresponds to a specific algorithm or concept, making it easy for learners to navigate and practice. The exercises serve as practical, hands-on reinforcement of theoretical concepts taught in the course. This collection is valuable for students and practitioners who want to strengthen their skills in machine learning through coding exercises.

Downloads: 1 This Week

Last Update: 4 days ago
See Project
16

TNT

A lightweight library for PyTorch training tools and utilities

TNT is a lightweight training framework developed by Meta that simplifies the process of building and managing machine learning training loops using PyTorch. The project focuses on providing a flexible yet structured environment for implementing training pipelines without the complexity of large deep learning frameworks. It introduces modular abstractions that allow developers to organize training logic into reusable components such as trainers, evaluators, and callbacks. This design helps separate concerns such as model training, evaluation, logging, and checkpointing, making machine learning experiments easier to manage. The framework is particularly useful for large-scale experiments where maintaining clear training workflows becomes increasingly important. Because it is built on top of PyTorch, the framework integrates naturally with existing deep learning models and datasets.

Downloads: 1 This Week

Last Update: 6 days ago
See Project
17

TPOT

A Python Automated Machine Learning tool that optimizes ML

Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming. TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assistant. TPOT is a Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Downloads: 1 This Week

Last Update: 2025-07-03
See Project
18

TabPFN

Foundation Model for Tabular Data

TabPFN is an open-source machine learning system that introduces a foundation model designed specifically for tabular data analysis. The model is based on transformer architectures and implements a prior-data fitted network that can perform supervised learning tasks such as classification and regression with minimal configuration. Unlike many traditional machine learning workflows that require extensive hyperparameter tuning and training cycles, TabPFN is pre-trained to perform inference directly on tabular datasets. This allows it to generate predictions extremely quickly, often within seconds, while maintaining competitive accuracy on small and medium-sized datasets. The system supports a variety of tabular machine learning tasks and is designed to handle structured datasets commonly found in spreadsheets, databases, and business analytics systems.

Downloads: 1 This Week

Last Update: 2026-03-26
See Project
19

Tensorforce

A TensorFlow library for applied reinforcement learning

Tensorforce is an open-source deep reinforcement learning framework built on TensorFlow, emphasizing modularized design and straightforward usability for applied research and practice.

Downloads: 1 This Week

Last Update: 9 hours ago
See Project
20

The Algorithms - C #

Collection of various algorithms in mathematics, machine learning

TheAlgorithms/C is an open-source repository that provides implementations of classic algorithms and data structures written in the C programming language. The project is part of the larger “The Algorithms” initiative, which aims to create educational resources by implementing algorithms in multiple programming languages. Within the C repository, contributors implement algorithms from many areas of computer science including sorting, searching, graph processing, mathematics, machine learning, and numerical methods. Each algorithm implementation is written to be readable and well documented so that students and developers can understand how the algorithm works internally. The repository is designed primarily as an educational resource that helps programmers study algorithm design and computational thinking. Contributors from around the world continually add new implementations and improvements, making the repository a collaborative learning project.

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
21

The Julia Programming Language

High-level, high-performance dynamic language for technical computing

Julia is a fast, open source high-performance dynamic language for technical computing. It can be used for data visualization and plotting, deep learning, machine learning, scientific computing, parallel computing and so much more. Having a high level syntax, Julia is easy to use for programmers of every level and background. Julia has more than 2,800 community-registered packages including various mathematical libraries, data manipulation tools, and packages for general purpose computing. Libraries from Python, R, C/Fortran, C++, and Java can also be used.

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
22

The SpeechBrain Toolkit

A PyTorch-based Speech Toolkit

SpeechBrain is an open-source and all-in-one conversational AI toolkit. It is designed to be simple, extremely flexible, and user-friendly. Competitive or state-of-the-art performance is obtained in various domains. SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers. Speaker recognition is already deployed in a wide variety of realistic applications. SpeechBrain provides different models for speaker recognition, including X-vector, ECAPA-TDNN, PLDA, and contrastive learning. Spectral masking, spectral mapping, and time-domain enhancement are different methods already available within SpeechBrain. Separation methods such as Conv-TasNet, DualPath RNN, and SepFormer are implemented as well. SpeechBrain provides efficient and GPU-friendly speech augmentation pipelines and acoustic features extraction.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
23

TorchAudio

Data manipulation and transformation for audio signal processing

The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch operations which makes it easy to use and feel like a natural extension.

Downloads: 1 This Week

Last Update: 2026-02-17
See Project
24

TorchIO

Medical imaging toolkit for deep learning

TorchIO is an open-source Python library for efficient loading, preprocessing, augmentation and patch-based sampling of 3D medical images in deep learning, following the design of PyTorch. It includes multiple intensity and spatial transforms for data augmentation and preprocessing. These transforms include typical computer vision operations such as random affine transformations and also domain-specific ones such as simulation of intensity artifacts due to MRI magnetic field inhomogeneity (bias) or k-space motion artifacts. TorchIO is a Python package containing a set of tools to efficiently read, preprocess, sample, augment, and write 3D medical images in deep learning applications written in PyTorch, including intensity and spatial transforms for data augmentation and preprocessing. Transforms include typical computer vision operations such as random affine transformations and also domain-specific ones such as simulation of intensity artifacts due to MRI magnetic field inhomogeneity.

Downloads: 1 This Week

Last Update: 12 hours ago
See Project
25

Trax

Deep learning with clear code and speed

Trax is an end-to-end library for deep learning that focuses on clear code and speed. It is actively used and maintained in the Google Brain team. Run a pre-trained Transformer, create a translator in a few lines of code. Features and resources, API docs, where to talk to us, how to open an issue and more. Walkthrough, how Trax works, how to make new models and train on your own data. Trax includes basic models (like ResNet, LSTM, Transformer) and RL algorithms (like REINFORCE, A2C, PPO). It is also actively used for research and includes new models like the Reformer and new RL algorithms like AWR. Trax has bindings to a large number of deep learning datasets, including Tensor2Tensor and TensorFlow datasets. You can use Trax either as a library from your own python scripts and notebooks or as a binary from the shell, which can be more convenient for training large models. It runs without any changes on CPUs, GPUs and TPUs.

Downloads: 1 This Week

Last Update: 2021-10-26
See Project