Open Source Linux Machine Learning Software - Page 11

Machine Learning Software for Linux

View 57 business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Turing.jl

    Turing.jl

    Bayesian inference with probabilistic programming

    Bayesian inference with probabilistic programming.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    VoxelMorph

    VoxelMorph

    Unsupervised Learning for Image Registration

    VoxelMorph is an open-source deep learning framework designed for medical image registration, a process that aligns multiple medical scans into a common spatial coordinate system. Traditional image registration techniques typically rely on optimization procedures that must be executed separately for each pair of images, which can be computationally expensive and slow. VoxelMorph approaches the problem using neural networks that learn to predict deformation fields that transform one image so that it aligns with another. Once the model has been trained, it can rapidly compute the transformation required to register new image pairs, significantly reducing computational time compared to classical registration algorithms. The framework supports both supervised and unsupervised learning approaches and is commonly used in medical imaging applications such as MRI alignment, anatomical analysis, and longitudinal studies.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Weights and Biases

    Weights and Biases

    Tool for visualizing and tracking your machine learning experiments

    Use W&B to build better models faster. Track and visualize all the pieces of your machine learning pipeline, from datasets to production models. Quickly identify model regressions. Use W&B to visualize results in real time, all in a central dashboard. Focus on the interesting ML. Spend less time manually tracking results in spreadsheets and text files. Capture dataset versions with W&B Artifacts to identify how changing data affects your resulting models. Reproduce any model, with saved code, hyperparameters, launch commands, input data, and resulting model weights. Set wandb.config once at the beginning of your script to save your hyperparameters, input settings (like dataset name or model type), and any other independent variables for your experiments. This is useful for analyzing your experiments and reproducing your work in the future. Setting configs also allows you to visualize the relationships between features of your model architecture or data pipeline and model performance.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Zero to Mastery Machine Learning

    Zero to Mastery Machine Learning

    All course materials for the Zero to Mastery Machine Learning

    Zero to Mastery Machine Learning is an open-source repository that contains the complete course materials for the Zero to Mastery Machine Learning and Data Science bootcamp. The project provides a structured curriculum designed to teach machine learning and data science using Python through hands-on projects and interactive notebooks. The repository includes datasets, Jupyter notebooks, documentation, and example code that walk learners through the entire machine learning workflow from problem definition to model deployment. The course introduces essential tools such as NumPy, pandas, Matplotlib, and scikit-learn before moving on to deep learning with frameworks like TensorFlow and Keras. It also includes milestone projects that demonstrate how to build end-to-end machine learning systems using real datasets, including classification and regression tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    audioFlux

    audioFlux

    A library for audio and music analysis, feature extraction

    A library for audio and music analysis, and feature extraction. Can be used for deep learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) ASR, etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    bulbea

    bulbea

    Deep Learning based Python Library for Stock Market Prediction

    bulbea is an open-source Python library designed for financial analysis and stock market prediction using machine learning and deep learning techniques. The library provides tools for retrieving financial time series data, preprocessing market data, and training predictive models that estimate future price movements. bulbea integrates common machine learning frameworks such as TensorFlow and Keras to build neural network models capable of learning patterns in historical financial data. It includes utilities for splitting datasets, normalizing time series, and training models such as recurrent neural networks that can capture temporal dependencies in market behavior. The library also incorporates sentiment analysis capabilities that analyze social media data, particularly from Twitter, to estimate public sentiment toward financial assets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    cortex

    cortex

    Production infrastructure for machine learning at scale

    Cortex is an open-source platform designed for building, deploying, and managing machine learning applications in production environments. The framework provides infrastructure tools that allow developers to transform trained machine learning models into scalable web services. Cortex handles many operational challenges associated with deploying AI systems, such as managing dependencies, orchestrating data pipelines, and scaling services under load. Developers can define machine learning pipelines as code using declarative configuration files, which simplifies the process of managing complex ML workflows. The platform supports integration with cloud environments and container orchestration systems so that applications can scale dynamically based on demand. It is designed to help teams focus on building machine learning logic rather than managing infrastructure details.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    cracking-the-data-science-interview

    cracking-the-data-science-interview

    A Collection of Cheatsheets, Books, Questions, and Portfolio

    Cracking the Data Science Interview is an open educational repository that collects study materials, resources, and reference links for preparing for data science interviews. The project organizes content across many fundamental areas of data science, including statistics, probability, SQL, machine learning, and deep learning. It includes cheat sheets that summarize important technical concepts commonly discussed during technical interviews. The repository also provides links to recommended books, tutorials, practice platforms, and blog posts that help learners strengthen their theoretical and practical skills. In addition to conceptual study materials, the project includes interview question banks and case study prompts that simulate real hiring scenarios. The resource is particularly useful for candidates preparing for technical interviews in data science, machine learning, or analytics roles.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    docext

    docext

    An on-premises, OCR-free unstructured data extraction

    docext is a document intelligence toolkit that uses vision-language models to extract structured information from documents such as PDFs, forms, and scanned images. The system is designed to operate entirely on-premises, allowing organizations to process sensitive documents without relying on external cloud services. Unlike traditional document processing pipelines that rely heavily on optical character recognition, docext leverages multimodal AI models capable of understanding both visual and textual information directly from document images. This allows the system to detect and extract structured elements such as tables, signatures, key fields, and layout information while maintaining semantic understanding of the document content. The toolkit can also convert complex documents into structured markdown representations that preserve formatting and contextual relationships.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    dtreeviz

    dtreeviz

    Python library for decision tree visualization & model interpretation

    A python library for decision tree visualization and model interpretation. Decision trees are the fundamental building block of gradient boosting machines and Random Forests(tm), probably the two most popular machine learning models for structured data. Visualizing decision trees is a tremendous aid when learning how these models work and when interpreting models. The visualizations are inspired by an educational animation by R2D3; A visual introduction to machine learning. Please see How to visualize decision trees for deeper discussion of our decision tree visualization library and the visual design decisions we made.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    eos

    eos

    A lightweight 3D Morphable Face Model library in modern C++

    eos is a lightweight 3D Morphable Face Model fitting library that provides basic functionality to use face models, as well as camera and shape fitting functionality. It's written in modern C++11/14. MorphableModel and PcaModel classes to represent 3DMMs, with basic operations like draw_sample(). Supports the Surrey Face Model (SFM), 4D Face Model (4DFM), Basel Face Model (BFM) 2009 and 2017, and the Liverpool-York Head Model (LYHM) out-of-the-box.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). Trainer provides a variety of built-in Callback functions to facilitate experiment recording, exception capture, etc. Automatic download of some datasets and pre-trained models.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    fastquant

    fastquant

    Backtest and optimize your ML trading strategies with only 3 lines

    fastquant is a Python library designed to simplify quantitative financial analysis and algorithmic trading strategy development. The project focuses on making backtesting accessible by providing a high-level interface that allows users to test investment strategies with only a few lines of code. It integrates historical market data sources and trading frameworks so that users can quickly build experiments without constructing complex data pipelines. The framework enables users to test common strategies such as moving average crossovers, momentum trading, and custom indicators on historical stock data. By automating data retrieval, strategy evaluation, and result visualization, the library reduces the barrier to entry for individuals interested in quantitative finance. The project also supports optimization workflows that allow users to search for parameter combinations that improve trading strategy performance.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    gensim

    gensim

    Topic Modelling for Humans

    Gensim is a Python library for topic modeling, document indexing, and similarity retrieval with large corpora. The target audience is the natural language processing (NLP) and information retrieval (IR) community.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    ggml

    ggml

    Tensor library for machine learning

    ggml is an open-source tensor library designed for efficient machine learning computation with a focus on running models locally and with minimal dependencies. Written primarily in C and C++, the library provides low-level tensor operations and automatic differentiation that allow developers to implement machine learning algorithms and neural networks efficiently. The project emphasizes portability and performance, enabling machine learning inference across a wide range of hardware environments including CPUs and specialized accelerators. It is widely used as a foundational component in projects that run large language models locally, including tools that perform inference for transformer-based models. The library also implements optimization algorithms and computation graph functionality so developers can build training and inference workflows directly on top of its tensor operations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    hloc

    hloc

    Visual localization made easy with hloc

    This is hloc, a modular toolbox for state-of-the-art 6-DoF visual localization. It implements Hierarchical Localization, leveraging image retrieval and feature matching, and is fast, accurate, and scalable. This codebase won the indoor/outdoor localization challenges at CVPR 2020 and ECCV 2020, in combination with SuperGlue, our graph neural network for feature matching. We provide step-by-step guides to localize with Aachen, InLoc, and to generate reference poses for your own data using SfM. Just download the datasets and you're reading to go! The notebook pipeline_InLoc.ipynb shows the steps for localizing with InLoc. It's much simpler since a 3D SfM model is not needed. We show in pipeline_SfM.ipynb how to run 3D reconstruction for an unordered set of images. This generates reference poses, and a nice sparse 3D model suitable for localization with the same pipeline as Aachen.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    ml.js

    ml.js

    Machine learning tools in JavaScript

    This library is a compilation of the tools developed in the mljs organization. It is mainly maintained for use in the browser. If you are working with Node.js, you might prefer to add to your dependencies only the libraries that you need, as they are usually published to npm more often. We prefix all our npm package names with ml- (eg. ml-matrix) so they are easy to find.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    pattern_classification

    pattern_classification

    A collection of tutorials and examples for solving machine learning

    The pattern_classification repository is an educational project that provides tutorials, examples, and reference materials related to machine learning and statistical pattern recognition. The project aims to help learners understand the process of building predictive models by presenting structured explanations and practical examples. It includes notebooks and guides that demonstrate data preprocessing, feature extraction, model training, and evaluation techniques used in machine learning workflows. The repository also covers algorithms such as Bayesian classification, logistic regression, neural networks, clustering methods, and ensemble models. In addition to algorithm tutorials, the project contains supplementary resources such as dataset collections, visualization examples, and links to recommended books and talks. These materials are designed to support both theoretical understanding and practical experimentation with machine learning tools.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    pmdarima

    pmdarima

    Statistical library designed to fill the void in Python's time series

    A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    segment-geospatial

    segment-geospatial

    A Python package for segmenting geospatial data with the SAM

    The segment-geospatial package draws its inspiration from segment-anything-eo repository authored by Aliaksandr Hancharenka. To facilitate the use of the Segment Anything Model (SAM) for geospatial data, I have developed the segment-anything-py and segment-geospatial Python packages, which are now available on PyPI and conda-forge. My primary objective is to simplify the process of leveraging SAM for geospatial data analysis by enabling users to achieve this with minimal coding effort. I have adapted the source code of segment-geospatial from the segment-anything-eo repository, and credit for its original version goes to Aliaksandr Hancharenka.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    supervision

    supervision

    We write your reusable computer vision tools

    We write your reusable computer vision tools. Whether you need to load your dataset from your hard drive, draw detections on an image or video, or count how many detections are in a zone. You can count on us.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    surpriver

    surpriver

    Find big moving stocks before they move using machine learning

    surpriver is a machine learning project designed to identify unusual stock market activity that may precede large price movements. The system analyzes historical stock price and volume data to detect anomalies that could indicate potential trading opportunities. By applying machine learning techniques to market indicators, the tool attempts to identify patterns in trading behavior that deviate significantly from normal market activity. These anomalies are interpreted as signals that a stock may soon experience a major upward or downward move. The framework includes modules for retrieving market data, computing technical indicators, and applying anomaly detection algorithms to identify unusual patterns. The project is intended as a research tool for quantitative finance experiments and algorithmic trading strategy development.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    tika-python

    tika-python

    Python binding to the Apache Tika™ REST services

    A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set the TIKA_SERVER_JAR environment variable to TIKA_SERVER_JAR="file:////tika-server.jar" which successfully tells python-tika to "download" this file and move it to /tmp/tika-server.jar and run as a background process. This is the only way to run python-tika without internet access. Without this set, the default is to check the tika version and pull latest every time from Apache.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24

    sgmweka

    Weka wrapper for the SGM toolkit for text classification and modeling.

    Weka wrapper for the SGM toolkit for text classification and modeling. Provides Sparse Generative Models for scalable and accurate text classification and modeling for use in high-speed and large-scale text mining. Has lower time complexity of classification than comparable software due to inference based on sparse model representation and use of an inverted index. The provided .zip file is in the Weka package format, giving access to text classification. Other functions are usable through either Java command-line commands or class inclusion into Java projects.
    Leader badge
    Downloads: 22 This Week
    Last Update:
    See Project
  • 25
    MEKA

    MEKA

    A Multi-label Extension to Weka

    Multi-label classifiers and evaluation procedures using the Weka machine learning framework.
    Downloads: 5 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB