Open Source Linux Machine Learning Software - Page 47

Sort By:

Machine Learning Software for Linux

View 58 business solutions

Machine Learning Linux Clear Filters

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
1

drvq

dimensionality-recursive vector quantization

drvq is a C++ library implementation of dimensionality-recursive vector quantization, a fast vector quantization method in high-dimensional Euclidean spaces under arbitrary data distributions. It is an approximation of k-means that is practically constant in data size and applies to arbitrarily high dimensions but can only scale to a few thousands of centroids. As a by-product of training, a tree structure performs either exact or approximate quantization on trained centroids, the latter being not very precise but extremely fast. A detailed README file describes the usage of the software, including license, requirements, installation, file formats, sample data, tools, and options. With the sample data provided and the default options, it is possible to test the code immediately as a demo. DRVQ has a 2-clause BSD license. Please refer to the DRVQ software home page, the research project, or the original publication for more information. The latest code is available at github.

Downloads: 0 This Week

Last Update: 2014-01-09
See Project
2

e-Metis - ML

Modul za napovedovanje učnih težav.

Downloads: 0 This Week

Last Update: 2016-05-24
See Project
3

easy12306

Automatic recognition of 12306 verification code

Automatic recognition of 12306 verification code using machine learning algorithm. Identify never-before-seen pictures.

Downloads: 0 This Week

Last Update: 2022-08-05
See Project
4

elasticsearch-learning-to-rank

Plugin to integrate Learning to Rank

The Elasticsearch Learning to Rank plugin uses machine learning to improve search relevance ranking. It's powering search at places like Wikimedia Foundation and Snagajob.

Downloads: 0 This Week

Last Update: 2026-02-19
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
5

elf-project

The name stands for ensemble learning framework. It is a collection of machine learning algorithms for classification and regression with the possibility of connecting them together via ensemble learning. It is written in C++.

Downloads: 0 This Week

Last Update: 2013-04-03
See Project
6

entity-metadata

Lists of people, churches, and other entities

Here are lists of entities, such as people, businesses, and churches. These are large files related to this repository https://github.com/az0/entity-metadata

1 Review

Downloads: 0 This Week

Last Update: 2024-01-25
See Project
7

exchange-core

Ultra-fast matching engine written in Java based on LMAX Disruptor

Exchange-core is an open-source market exchange core based on LMAX Disruptor, Eclipse Collections (ex. Goldman Sachs GS Collections), Real Logic Agrona, OpenHFT Chronicle-Wire, LZ4 Java, and Adaptive Radix Trees. Designed for high scalability and pauseless 24/7 operation under high-load conditions and providing low-latency responses. Single order book configuration is capable to process 5M operations per second on 10-years old hardware (Intel® Xeon® X5690) with moderate latency degradation. HFT optimized. Priority is a limit-order-move operation mean latency (currently ~0.5µs). Cancel operation takes ~0.7µs, placing new order ~1.0µs. Disk journaling and journal replay support, state snapshots (serialization) and restore operations, LZ4 compression. Lock-free and contention-free order matching and risk control algorithms. Matching engine and risk control operations are atomic and deterministic.

Downloads: 0 This Week

Last Update: 2022-04-15
See Project
8

fantail-mlkit

The fantail machine learning toolkit (Moved)

Moved to https://github.com/quansun/fantail-ml

Downloads: 0 This Week

Last Update: 2017-06-16
See Project
9

fastNLP

fastNLP: A Modularized and Extensible NLP Framework

fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). Trainer provides a variety of built-in Callback functions to facilitate experiment recording, exception capture, etc. Automatic download of some datasets and pre-trained models.

Downloads: 0 This Week

Last Update: 2022-08-05
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

fastai

Deep learning library

fastai is a deep learning library which provides practitioners with high-level components that can quickly and easily provide state-of-the-art results in standard deep learning domains, and provides researchers with low-level components that can be mixed and matched to build new approaches. It aims to do both things without substantial compromises in ease of use, flexibility, or performance. This is possible thanks to a carefully layered architecture, which expresses common underlying patterns of many deep learning and data processing techniques in terms of decoupled abstractions. These abstractions can be expressed concisely and clearly by leveraging the dynamism of the underlying Python language and the flexibility of the PyTorch library. fastai is organized around two main design goals: to be approachable and rapidly productive, while also being deeply hackable and configurable. It is built on top of a hierarchy of lower-level APIs which provide composable building blocks.

Downloads: 0 This Week

Last Update: 2026-02-14
See Project
11

fastdup

An unsupervised and free tool for image and video dataset analysis

fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.

Downloads: 0 This Week

Last Update: 2024-08-16
See Project
12

fe4ml-zh

Feature Engineering for Machine Learning

fe4ml-zh is an open-source project that provides a Chinese translation and structured documentation of the book Feature Engineering for Machine Learning. The repository aims to make advanced feature engineering concepts accessible to a broader audience by translating the content and organizing it into readable documentation and code examples. Feature engineering is a critical component of machine learning pipelines because it determines how raw data is transformed into features that algorithms can use effectively. The project explains techniques for creating, selecting, and transforming features in ways that improve model accuracy and robustness. It also discusses the role of domain knowledge, data preprocessing, and statistical reasoning in building effective machine learning models.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
13

feed4weka

feed4weka is an open library that enriches weka (http://www.cs.waikato.ac.nz/ml/weka/), an open source project for data analysis. It integrates new classification and clustering algorithms, and adds the coclustering and outlier detection frameworks

Downloads: 0 This Week

Last Update: 2013-07-01
See Project
14

find-similar

User-friendly library to find similar objects

The mission of the FindSimilar project is to provide a powerful and versatile open source library that empowers developers to efficiently find similar objects and perform comparisons across a variety of data types. Whether dealing with texts, images, audio, or more, our project aims to simplify the process of identifying similarities and enhancing decision-making. https://github.com/findsimilar/find-similar - GitHub repo http://demo.findsimilar.org/ - Demo project and tutorial https://docs.findsimilar.org/ - Documentation

1 Review

Downloads: 0 This Week

Last Update: 2023-11-12
See Project
15

fklearn

Functional Machine Learning

fklearn uses functional programming principles to make it easier to solve real problems with Machine Learning.

Downloads: 0 This Week

Last Update: 2025-02-26
See Project
16

flair

A very simple framework for state-of-the-art NLP

A very simple framework for state-of-the-art NLP. Developed by Humboldt University of Berlin and friends. A powerful NLP library. Flair allows you to apply our state-of-the-art natural language processing (NLP) models to your text, such as named entity recognition (NER), sentiment analysis, part-of-speech tagging (PoS), special support for biomedical texts, sense disambiguation and classification, with support for a rapidly growing number of languages. A text embedding library. Flair has simple interfaces that allow you to use and combine different word and document embeddings, including our proposed Flair embeddings and various transformers. A PyTorch NLP framework. Our framework builds directly on PyTorch, making it easy to train your own models and experiment with new approaches using Flair embeddings and classes.

Downloads: 0 This Week

Last Update: 2025-02-05
See Project
17

fscaret_shiny

UI for fscaret

User Interface (ui) application which implements the automated feature selection provided by the 'fscaret' package of R-environment.

Downloads: 0 This Week

Last Update: 2018-04-04
See Project
18

fsm4j

A general purpose Finite State Machine written in Java. It is easy to use, powerful, and fast.

Downloads: 0 This Week

Last Update: 2016-08-01
See Project
19

fugue

A unified interface for distributed computing

Fugue is a unified interface for distributed computing that lets users execute Python, Pandas, and SQL code on Spark, Dask, and Ray with minimal rewrites.

Downloads: 0 This Week

Last Update: 2026-02-20
See Project
20

ggml

Tensor library for machine learning

ggml is an open-source tensor library designed for efficient machine learning computation with a focus on running models locally and with minimal dependencies. Written primarily in C and C++, the library provides low-level tensor operations and automatic differentiation that allow developers to implement machine learning algorithms and neural networks efficiently. The project emphasizes portability and performance, enabling machine learning inference across a wide range of hardware environments including CPUs and specialized accelerators. It is widely used as a foundational component in projects that run large language models locally, including tools that perform inference for transformer-based models. The library also implements optimization algorithms and computation graph functionality so developers can build training and inference workflows directly on top of its tensor operations.

Downloads: 0 This Week

Last Update: 1 day ago
See Project
21

gplearn

Genetic Programming in Python, with a scikit-learn inspired API

gplearn implements Genetic Programming in Python, with a scikit-learn-inspired and compatible API. While Genetic Programming (GP) can be used to perform a very wide variety of tasks, gplearn is purposefully constrained to solving symbolic regression problems. This is motivated by the scikit-learn ethos, of having powerful estimators that are straightforward to implement. Symbolic regression is a machine learning technique that aims to identify an underlying mathematical expression that best describes a relationship. It begins by building a population of naive random formulas to represent a relationship between known independent variables and their dependent variable targets in order to predict new data. Each successive generation of programs is then evolved from the one that came before it by selecting the fittest individuals from the population to undergo genetic operations.

Downloads: 0 This Week

Last Update: 2026-01-07
See Project
22

gradslam

gradslam is an open source differentiable dense SLAM library

gradslam is an open-source framework providing differentiable building blocks for simultaneous localization and mapping (SLAM) systems. We enable the usage of dense SLAM subsystems from the comfort of PyTorch. The question of “representation” is central in the context of dense simultaneous localization and mapping (SLAM). Newer learning-based approaches have the potential to leverage data or task performance to directly inform the choice of representation. However, learning representations for SLAM has been an open question, because traditional SLAM systems are not end-to-end differentiable. In this work, we present gradSLAM, a differentiable computational graph take on SLAM. Leveraging the automatic differentiation capabilities of computational graphs, gradSLAM enables the design of SLAM systems that allow for gradient-based learning across each of their components, or the system as a whole.

Downloads: 0 This Week

Last Update: 2022-08-22
See Project
23

gunu

Definir un entorno (medio ambiente) y a micro-organismos (gunus) que vivan en el y se desarrollen es el objeto de este proyecto.

Downloads: 0 This Week

Last Update: 2015-12-17
See Project
24

handson-ml

Teaching you the fundamentals of Machine Learning in python

handson-ml hosts the notebooks for the first edition of the same hands-on ML book, reflecting the tooling and idioms of its time while teaching durable concepts. It walks through supervised and unsupervised learning with scikit-learn, then introduces deep learning using the earlier TensorFlow 1 graph-execution style. The examples underscore fundamentals like bias-variance trade-offs, regularization, and proper validation, grounding learners before they move to deep nets. Even though the deep learning stack evolved, the classical ML sections remain highly relevant for production data problems. The code is crafted to be clear rather than clever, prioritizing readability for newcomers. As a historical snapshot and a still-useful primer, it pairs well with the second edition for understanding how the ecosystem matured.

Downloads: 0 This Week

Last Update: 2026-03-19
See Project
25

hls4ml

Machine learning on FPGAs using HLS

hls4ml is an open-source framework that enables machine learning models to be implemented directly on hardware such as FPGAs and ASICs using high-level synthesis techniques. The system converts trained neural network models from common machine learning frameworks into hardware description code suitable for ultra-low-latency inference. This approach allows machine learning algorithms to run directly on specialized hardware, making them suitable for applications that require extremely fast response times and minimal power consumption. The framework was originally developed for high-energy physics experiments where real-time decision systems must process large volumes of data with strict latency constraints. Over time, it has expanded to support a variety of scientific and industrial applications including signal processing, embedded systems, and biomedical monitoring.

Downloads: 0 This Week

Last Update: 2026-03-20
See Project