Open Source Linux Machine Learning Software - Page 5

Sort By:

Machine Learning Software for Linux

View 58 business solutions

Machine Learning Linux Clear Filters

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
1

Pandas Profiling

Create HTML profiling reports from pandas DataFrame objects

pandas-profiling generates profile reports from a pandas DataFrame. The pandas df.describe() function is handy yet a little basic for exploratory data analysis. pandas-profiling extends pandas DataFrame with df.profile_report(), which automatically generates a standardized univariate and multivariate report for data understanding. High correlation warnings, based on different correlation metrics (Spearman, Pearson, Kendall, Cramér’s V, Phik). Most common categories (uppercase, lowercase, separator), scripts (Latin, Cyrillic) and blocks (ASCII, Cyrilic). File sizes, creation dates, dimensions, indication of truncated images and existance of EXIF metadata. Mostly global details about the dataset (number of records, number of variables, overall missigness and duplicates, memory footprint). Comprehensive and automatic list of potential data quality issues (high correlation, skewness, uniformity, zeros, missing values, constant values, between others).

Downloads: 4 This Week

Last Update: 5 days ago
See Project
2

Pwnagotchi

Deep Reinforcement learning instrumenting bettercap for WiFi pwning

Pwnagotchi is an A2C-based “AI” powered by bettercap and running on a Raspberry Pi Zero W that learns from its surrounding WiFi environment in order to maximize the crackable WPA key material it captures (either through passive sniffing or by performing deauthentication and association attacks). This material is collected on disk as PCAP files containing any form of handshake supported by hashcat, including full and half WPA handshakes as well as PMKIDs. Instead of merely playing Super Mario or Atari games like most reinforcement learning based “AI” (yawn), Pwnagotchi tunes its own parameters over time to get better at pwning WiFi things in the real world environments you expose it to. To give hackers an excuse to learn about reinforcement learning and WiFi networking, and have a reason to get out for more walks.

Downloads: 4 This Week

Last Update: 2021-11-29
See Project
3

PyTorch Geometric

Geometric deep learning extension library for PyTorch

It consists of various methods for deep learning on graphs and other irregular structures, also known as geometric deep learning, from a variety of published papers. In addition, it consists of an easy-to-use mini-batch loader for many small and single giant graphs, a large number of common benchmark datasets (based on simple interfaces to create your own), and helpful transforms, both for learning on arbitrary graphs as well as on 3D meshes or point clouds. We have outsourced a lot of functionality of PyTorch Geometric to other packages, which needs to be additionally installed. These packages come with their own CPU and GPU kernel implementations based on C++/CUDA extensions. We do not recommend installation as root user on your system python. Please setup an Anaconda/Miniconda environment or create a Docker image. We provide pip wheels for all major OS/PyTorch/CUDA combinations.

Downloads: 4 This Week

Last Update: 2025-10-14
See Project
4

Scanpy

Single-cell analysis in Python

Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata. It includes preprocessing, visualization, clustering, trajectory inference and differential expression testing. The Python-based implementation efficiently deals with datasets of more than one million cells.

Downloads: 4 This Week

Last Update: 2026-04-10
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Smile

Statistical machine intelligence and learning engine

Smile is a fast and comprehensive machine learning engine. With advanced data structures and algorithms, Smile delivers the state-of-art performance. Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is a couple of times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster? Write applications quickly in Java, Scala, or any JVM languages. Data scientists and developers can speak the same language now! Smile provides hundreds advanced algorithms with clean interface. Scala API also offers high-level operators that make it easy to build machine learning apps. And you can use it interactively from the shell, embedded in Scala. The most complete machine learning engine. Smile covers every aspect of machine learning.

Downloads: 4 This Week

Last Update: 1 day ago
See Project
6

mlx

MLX: An array framework for Apple silicon

MlX offers a local web interface to browse, download, and run ML models via Hugging Face or local sources. It supports searching by tags or tasks, visualization of model metadata, quick inference demos, automatic setup of runtime environments, and works with PyTorch, TensorFlow, and ONNX. Ideal for researchers exploring and testing models via browser.

Downloads: 4 This Week

Last Update: 6 days ago
See Project
7

spaCy models

Models for the spaCy Natural Language Processing (NLP) library

spaCy is designed to help you do real work, to build real products, or gather real insights. The library respects your time, and tries to avoid wasting it. It's easy to install, and its API is simple and productive. spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. If your application needs to process entire web dumps, spaCy is the library you want to be using. Since its release in 2015, spaCy has become an industry standard with a huge ecosystem. Choose from a variety of plugins, integrate with your machine learning stack and build custom components and workflows.

Downloads: 4 This Week

Last Update: 2026-03-18
See Project
8

AI Engineer Headquarters

A collection of scientific methods, processes, algorithms

AI-Engineer-Headquarters is a comprehensive educational repository designed to help developers become advanced AI engineers through a structured learning path and practical system-building exercises. The project serves as a curated collection of resources, methodologies, and tools covering topics across the entire artificial intelligence development lifecycle. Rather than focusing only on theoretical knowledge, the repository emphasizes applied learning and encourages engineers to build real systems that incorporate machine learning, large language models, data pipelines, and AI infrastructure. The curriculum includes a progression of topics such as foundational AI engineering skills, machine learning systems design, large language model usage, retrieval-augmented generation systems, model fine-tuning, and autonomous AI agents. It also promotes disciplined learning routines and project-based practice so learners can develop practical experience and build deployable solutions.

Downloads: 3 This Week

Last Update: 2026-03-11
See Project
9

Andrew NG Notes Collection

This is Andrew NG Coursera Handwritten Notes

Andrew-NG-Notes is a repository that provides comprehensive study notes for Andrew Ng’s widely known machine learning course. The project summarizes the key topics covered in the course, including supervised learning, neural networks, optimization algorithms, and model evaluation techniques. The notes aim to simplify complex mathematical explanations by organizing concepts into clear sections with diagrams, formulas, and concise descriptions. Each chapter mirrors the structure of the course curriculum, allowing students to review the material in a systematic way while following along with the lectures. The repository emphasizes conceptual clarity and practical understanding, helping learners connect mathematical foundations with real machine learning applications.

Downloads: 3 This Week

Last Update: 2026-03-11
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
10

CFNet

Training a Correlation Filter end-to-end allows lightweight networks

CFNet is the official implementation of End-to-end representation learning for Correlation Filter based tracking (CVPR 2017) by Luca Bertinetto, Jack Valmadre, João F. Henriques, Andrea Vedaldi, and Philip H. S. Torr. The framework combines correlation filters with deep convolutional neural networks to create an efficient and accurate visual object tracker. Unlike traditional correlation filter trackers that rely on hand-crafted features, CFNet learns feature representations directly from data in an end-to-end fashion. This allows the tracker to be both computationally efficient and robust to appearance changes such as scale, rotation, and illumination variations. The repository provides pre-trained models, training code, and testing scripts for evaluating the tracker on standard benchmarks. By bridging the gap between correlation filters and deep learning, CFNet provides a foundation for further research in real-time object tracking.

Downloads: 3 This Week

Last Update: 3 days ago
See Project
11

DeepDanbooru

AI based multi-label girl image classification system

DeepDanbooru is a deep learning system designed to automatically tag anime-style images using neural networks trained on datasets derived from the Danbooru imageboard. The project focuses on multi-label image classification, where a model predicts multiple descriptive tags that represent visual elements in an image. These tags may include characters, styles, clothing, emotions, or other attributes associated with anime artwork. The system uses convolutional neural networks trained on large datasets of tagged images to learn relationships between visual features and textual labels. Because the Danbooru dataset contains millions of images with extensive annotations, it provides a valuable training resource for machine learning models specializing in illustration analysis. Such datasets have been widely used for tasks including automatic image tagging, anime face detection, and generative modeling research.

Downloads: 3 This Week

Last Update: 2026-03-15
See Project
12

Diff Zoo

Differentiation for Hackers

Diff-zoo is a learning-focused handbook designed to demystify algorithmic differentiation (AD), the core technique powering modern machine learning frameworks. The project introduces AD from a foundational calculus perspective and gradually builds towards toy implementations that resemble systems like PyTorch and TensorFlow. It clarifies the differences and connections between forward mode, reverse mode, symbolic, numeric, tracing, and source transformation approaches to differentiation. Unlike production-grade AD systems that are often obscured by complex implementation details, these examples are deliberately simple and coherent to highlight the fundamental ideas. The repository is organized as a set of Julia notebooks, allowing learners to explore concepts interactively and compare different methods side by side. By stripping away unnecessary complexity, diff-zoo serves as both an educational resource and a practical guide for anyone.

Downloads: 3 This Week

Last Update: 1 day ago
See Project
13

FID score for PyTorch

Compute FID scores with PyTorch

This is a port of the official implementation of Fréchet Inception Distance to PyTorch. FID is a measure of similarity between two datasets of images. It was shown to correlate well with human judgement of visual quality and is most often used to evaluate the quality of samples of Generative Adversarial Networks. FID is calculated by computing the Fréchet distance between two Gaussians fitted to feature representations of the Inception network. The weights and the model are exactly the same as in the official Tensorflow implementation, and were tested to give very similar results (e.g. .08 absolute error and 0.0009 relative error on LSUN, using ProGAN generated images). However, due to differences in the image interpolation implementation and library backends, FID results still differ slightly from the original implementation. In difference to the official implementation, you can choose to use a different feature layer of the Inception network instead of the default pool3 layer.

Downloads: 3 This Week

Last Update: 2022-08-11
See Project
14

GIMP ML

AI for GNU Image Manipulation Program

This repository introduces GIMP3-ML, a set of Python plugins for the widely popular GNU Image Manipulation Program (GIMP). It enables the use of recent advances in computer vision to the conventional image editing pipeline. Applications from deep learning such as monocular depth estimation, semantic segmentation, mask generative adversarial networks, image super-resolution, de-noising and coloring have been incorporated with GIMP through Python-based plugins. Additionally, operations on images such as edge detection and color clustering have also been added. GIMP-ML relies on standard Python packages such as numpy, scikit-image, pillow, pytorch, open-cv, scipy. In addition, GIMP-ML also aims to bring the benefits of using deep learning networks used for computer vision tasks to routine image processing workflows.

Downloads: 3 This Week

Last Update: 2022-08-19
See Project
15

Lepton AI

A Pythonic framework to simplify AI service building

A Pythonic framework to simplify AI service building. Cutting-edge AI inference and training, unmatched cloud-native experience, and top-tier GPU infrastructure. Ensure 99.9% uptime with comprehensive health checks and automatic repairs.

Downloads: 3 This Week

Last Update: 2026-04-16
See Project
16

Machine Learning Beginner

Machine Learning Beginner Public Account Works

Machine Learning Beginner targets newcomers who are just getting started with machine learning and need a gentle, guided path. It introduces the core vocabulary and the mental map of supervised and unsupervised learning before moving into simple algorithms. The materials prioritize conceptual clarity, then progressively add code to solidify understanding. Step-by-step examples help learners see how data preparation, model training, evaluation, and iteration fit together. Because the scope is intentionally beginner-friendly, it’s an approachable springboard to more advanced resources. Educators also reference it as a compact toolkit for workshops and short intro courses.

Downloads: 3 This Week

Last Update: 2025-11-03
See Project
17

MagicMirror²

Modular smart mirror platform with a list of installable modules

MagicMirror² is Open Source, free and maintained by a big group of enthusiasts. Got a nice idea? Send us a pull request and become a part of the big list of contributors. The core of MagicMirror² contains a strong API which allows 3rd party developers to build additional modules. Modules you can use. Modules you can develop. Read our extensive documentation to find out everything you want to know about the MagicMirror² project. The full API description allows you to build your own modules. On the forum you will find a big list of MagicMirror² enthusiasts. Share your ideas, ask your questions and get support. The perfect place for you to start. MagicMirror² has an extensively documentated API. It allows you to built your own module backed by a powerful backend. Check out the API documentation for more information and start developing today.

Downloads: 3 This Week

Last Update: 2026-04-01
See Project
18

OpenVINO Notebooks

Jupyter notebook tutorials for OpenVINO

openvino_notebooks is a collection of interactive Jupyter notebooks designed to demonstrate how to build, optimize, and deploy artificial intelligence applications using the OpenVINO toolkit. The repository provides practical tutorials that guide developers through various AI workflows including computer vision, natural language processing, and generative AI tasks. Each notebook demonstrates how to run pre-trained models, optimize inference performance, and deploy models across hardware such as CPUs, GPUs, and specialized accelerators. The tutorials also illustrate how OpenVINO integrates with models from frameworks like PyTorch, TensorFlow, and ONNX to accelerate inference workloads. Many notebooks include end-to-end examples that show how to prepare input data, load optimized models, run inference, and visualize results. The project is particularly useful for developers who want to learn how to optimize machine learning inference pipelines for production environments.

Downloads: 3 This Week

Last Update: 5 days ago
See Project
19

Responsible AI Toolbox

Responsible AI Toolbox is a suite of tools providing model

Responsible AI Toolbox is a software framework designed to help developers evaluate and improve the reliability, fairness, and transparency of machine learning systems. The project provides tools that assist in analyzing model behavior, detecting bias, improving robustness, and explaining predictions produced by AI systems. It is designed to integrate with common machine learning frameworks, especially PyTorch, allowing developers to apply responsible AI techniques within existing workflows. The toolbox includes methods for adversarial testing, interpretability analysis, and model diagnostics that help developers understand how models behave under different conditions. These capabilities are particularly important for high-impact domains where AI systems must meet strict reliability and fairness requirements. By offering reusable components and standardized workflows, the framework helps organizations implement responsible AI practices throughout the machine learning lifecycle.

Downloads: 3 This Week

Last Update: 2026-03-12
See Project
20

Stanford Machine Learning Course

machine learning course programming exercise

The Stanford Machine Learning Course Exercises repository contains programming assignments from the well-known Stanford Machine Learning online course. It includes implementations of a variety of fundamental algorithms using Python and MATLAB/Octave. The repository covers a broad set of topics such as linear regression, logistic regression, neural networks, clustering, support vector machines, and recommender systems. Each folder corresponds to a specific algorithm or concept, making it easy for learners to navigate and practice. The exercises serve as practical, hands-on reinforcement of theoretical concepts taught in the course. This collection is valuable for students and practitioners who want to strengthen their skills in machine learning through coding exercises.

Downloads: 3 This Week

Last Update: 1 day ago
See Project
21

Teachable Machine

Explore how machine learning works, live in the browser

Teachable Machine is the open-source implementation of an experimental machine learning tool created by Google Creative Lab that allows users to train simple machine learning models directly in a web browser. The project demonstrates how neural networks can be trained interactively using images captured from a webcam or other inputs without requiring programming knowledge. Users can provide example images for different categories, and the system trains a model that learns to classify those inputs in real time. The project is built using web technologies and the TensorFlow.js ecosystem, enabling machine learning models to run locally within the browser environment. Because the training occurs locally, the system can respond quickly to new examples and provide immediate feedback to users.

Downloads: 3 This Week

Last Update: 2026-03-11
See Project
22

Zero to Mastery Machine Learning

All course materials for the Zero to Mastery Machine Learning

Zero to Mastery Machine Learning is an open-source repository that contains the complete course materials for the Zero to Mastery Machine Learning and Data Science bootcamp. The project provides a structured curriculum designed to teach machine learning and data science using Python through hands-on projects and interactive notebooks. The repository includes datasets, Jupyter notebooks, documentation, and example code that walk learners through the entire machine learning workflow from problem definition to model deployment. The course introduces essential tools such as NumPy, pandas, Matplotlib, and scikit-learn before moving on to deep learning with frameworks like TensorFlow and Keras. It also includes milestone projects that demonstrate how to build end-to-end machine learning systems using real datasets, including classification and regression tasks.

Downloads: 3 This Week

Last Update: 2026-03-11
See Project
23

fklearn

Functional Machine Learning

fklearn uses functional programming principles to make it easier to solve real problems with Machine Learning.

Downloads: 3 This Week

Last Update: 5 days ago
See Project
24

gplearn

Genetic Programming in Python, with a scikit-learn inspired API

gplearn implements Genetic Programming in Python, with a scikit-learn-inspired and compatible API. While Genetic Programming (GP) can be used to perform a very wide variety of tasks, gplearn is purposefully constrained to solving symbolic regression problems. This is motivated by the scikit-learn ethos, of having powerful estimators that are straightforward to implement. Symbolic regression is a machine learning technique that aims to identify an underlying mathematical expression that best describes a relationship. It begins by building a population of naive random formulas to represent a relationship between known independent variables and their dependent variable targets in order to predict new data. Each successive generation of programs is then evolved from the one that came before it by selecting the fittest individuals from the population to undergo genetic operations.

Downloads: 3 This Week

Last Update: 2026-01-07
See Project
25

spaGO

Self-contained Machine Learning and Natural Language Processing lib

A Machine Learning library written in pure Go designed to support relevant neural architectures in Natural Language Processing. Spago is self-contained, in that it uses its own lightweight computational graph both for training and inference, easy to understand from start to finish. The core module of Spago relies only on testify for unit testing. In other words, it has "zero dependencies", and we are committed to keeping it that way as much as possible. Spago uses a multi-module workspace to ensure that additional dependencies are downloaded only when specific features (e.g. persistent embeddings) are used. A good place to start is by looking at the implementation of built-in neural models, such as the LSTM. Except for a few linear algebra operations written in assembly for optimal performance (a bit of copying from Gonum), it's straightforward Go code, so you don't have to worry.

Downloads: 3 This Week

Last Update: 2023-10-30
See Project