Showing 73 open source projects for "clustering"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    daily-paper-computer-vision

    daily-paper-computer-vision

    Document papers compiled daily in computer vision/deep learning

    This repo is a running feed of computer-vision research, tracking new papers and notable results so practitioners can keep up without scouring multiple sites. It’s organized chronologically and often thematically, making it easy to scan what’s new in detection, segmentation, recognition, generative vision, 3D, and video understanding. The cadence is intentionally frequent, reflecting how quickly CV advances and how hard it is to maintain awareness while working full time. By aggregating...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Spektral

    Spektral

    Graph Neural Networks with Keras and Tensorflow 2

    ...The main goal of this project is to provide a simple but flexible framework for creating graph neural networks (GNNs). You can use Spektral for classifying the users of a social network, predicting molecular properties, generating new graphs with GANs, clustering nodes, predicting links, and any other task where data is described by graphs. Spektral implements some of the most popular layers for graph deep learning. Spektral also includes lots of utilities for representing, manipulating, and transforming graphs in your graph deep learning projects. Spektral is compatible with Python 3.6 and above, and is tested on the latest versions of Ubuntu, MacOS, and Windows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    MLPACK is a C++ machine learning library with emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and flexibility for expert users. * More info + downloads: https://mlpack.org * Git repo: https://github.com/mlpack/mlpack
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Machine Learning Git Codebook

    Machine Learning Git Codebook

    For extensive instructor led learning

    ...The project is designed as a self-paced learning resource that walks learners through the full data science workflow, including data preprocessing, exploratory analysis, feature engineering, and model development. It covers a wide range of machine learning techniques such as decision trees, clustering methods, nearest neighbor algorithms, anomaly detection, and probabilistic classifiers. The repository organizes these topics into sequential notebooks that explain theoretical concepts while allowing users to experiment directly with code. Many lessons emphasize hands-on exercises where learners analyze datasets, implement algorithms, and evaluate results through visualizations and statistical metrics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Python ML Jupyter Notebooks

    Python ML Jupyter Notebooks

    Practice and tutorial-style notebooks

    Python ML Jupyter Notebooks is an educational repository that demonstrates how to implement machine learning algorithms and data science workflows using Python. The project provides numerous examples and tutorials covering classical machine learning techniques such as regression, classification, clustering, and dimensionality reduction. It includes code implementations that show how to build models using popular libraries like scikit-learn, NumPy, pandas, and Matplotlib. The repository is designed to help learners understand both the theory and practical implementation of machine learning algorithms through step-by-step code examples. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    mlr

    mlr

    Machine Learning in R

    ...The framework provides supervised methods like classification, regression, and survival analysis along with their corresponding evaluation and optimization methods, as well as unsupervised methods like clustering. It is written in a way that you can extend it yourself or deviate from the implemented convenience methods and construct your own complex experiments or algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Pattern

    Pattern

    Web mining module for Python, with tools for scraping

    ...In addition to data mining features, the library offers natural language processing functionality including part-of-speech tagging, sentiment analysis, and n-gram extraction. The framework also includes machine learning algorithms that support classification, clustering, and vector space modeling for text analysis tasks. Another component of the library provides tools for analyzing and visualizing networks, making it useful for studying relationships between entities in large datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Python Machine Learning 3rd Ed.

    Python Machine Learning 3rd Ed.

    The "Python Machine Learning (3rd edition)" book code repository

    ...The project provides implementations of machine learning algorithms and data science workflows described in the book, enabling readers to experiment with real code while studying theoretical concepts. The repository includes Python notebooks and scripts demonstrating techniques such as data preprocessing, classification, regression, clustering, neural networks, and model evaluation. These examples are designed to illustrate how machine learning algorithms operate internally and how they can be applied to real datasets. Many examples rely on widely used libraries such as NumPy, scikit-learn, and deep learning frameworks to demonstrate modern machine learning workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Pattern Recognition and Machine Learning

    Pattern Recognition and Machine Learning

    Repository of notes, code and notebooks in Python

    ...These notebooks provide visualizations and computational demonstrations that help clarify complex topics such as probabilistic models, neural networks, kernel methods, and graphical models. The repository also includes implementations of sampling methods, clustering algorithms, and dimensionality reduction techniques used throughout machine learning research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    igel

    igel

    Machine learning tool that allows you to train and test models

    A delightful machine learning tool that allows you to train/fit, test, and use models without writing code. The goal of the project is to provide machine learning for everyone, both technical and non-technical users. I sometimes needed a tool sometimes, which I could use to fast create a machine learning prototype. Whether to build some proof of concept, create a fast draft model to prove a point or use auto ML. I find myself often stuck writing boilerplate code and thinking too much about...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Texthero

    Texthero

    Text preprocessing, representation and visualization from zero to hero

    Texthero is a python package to work with text data efficiently. It empowers NLP developers with a tool to quickly understand any text-based dataset and it provides a solid pipeline to clean and represent text data, from zero to hero.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Machine-Learning-Notes

    Machine-Learning-Notes

    Zhou Zhihua's "Machine Learning" push notes

    ...The notes span sixteen chapters that cover a wide range of topics, including model evaluation, linear models, decision trees, neural networks, support vector machines, Bayesian classifiers, ensemble methods, clustering, dimensionality reduction, and reinforcement learning. Each section explains the theoretical principles of the algorithms and walks through derivations to help readers understand why the methods work rather than simply how to use them. The repository organizes the material into printable chapters so that students can study the notes offline or use them as reference material while learning machine learning theory.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CDF-TS
    This Matlab code is used for demonstration of the effect of CDF-TS as a preprocessing method to transform data. Written by Ye Zhu, Deakin University, April 2021, version 1.0. This software is under GNU General Public License version 3.0 (GPLv3) This code is a demo of method described by the following publication: Zhu, Y., Ting, K.M., Carman, M. and Angelova, M., 2021, April. CDF Transform-and-Shift: An effective way to deal with datasets of inhomogeneous cluster densities. Pattern...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    GIMP ML

    GIMP ML

    AI for GNU Image Manipulation Program

    ...Applications from deep learning such as monocular depth estimation, semantic segmentation, mask generative adversarial networks, image super-resolution, de-noising and coloring have been incorporated with GIMP through Python-based plugins. Additionally, operations on images such as edge detection and color clustering have also been added. GIMP-ML relies on standard Python packages such as numpy, scikit-image, pillow, pytorch, open-cv, scipy. In addition, GIMP-ML also aims to bring the benefits of using deep learning networks used for computer vision tasks to routine image processing workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Isolation Similarity

    Isolation Similarity

    aNNE similarity based on Isolation Kernel

    ...Written by Xiaoyu Qin, Monash University, March 2019, version 1.0 This software is under GNU General Public License version 3.0 (GPLv3) This code is a demo of method described by the following publication: Qin, X., Ting, K.M., Zhu, Y. and Lee, V.C., 2019, July. Nearest-neighbour-induced isolation similarity and its impact on density-based clustering. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 33, pp. 4755-4762). https://ojs.aaai.org//index.php/AAAI/article/view/4402 Bibtex format: @inproceedings{qin2019nearest, title={Nearest-neighbour-induced isolation similarity and its impact on density-based clustering}, author={Qin, Xiaoyu and Ting, Kai Ming and Zhu, Ye and Lee, Vincent CS}, booktitle={Proceedings of the AAAI Conference on Artificial Intelligence}, volume={33}, pages={4755--4762}, year={2019} }
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Spectral Python

    A python module for hyperspectral image processing

    Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Machine Learning with TensorFlow

    Machine Learning with TensorFlow

    Accompanying source code for Machine Learning with TensorFlow

    ...The project provides numerous code samples demonstrating how to build machine learning models using the TensorFlow framework. These examples illustrate core machine learning concepts such as regression, classification, clustering, and neural networks through practical implementations. The repository includes implementations of algorithms such as logistic regression, convolutional neural networks, and autoencoders, which allow readers to experiment with different learning techniques. Many examples are structured as standalone scripts or notebooks that can be executed directly to reproduce the results described in the book. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TensorFlow Machine Learning Cookbook

    TensorFlow Machine Learning Cookbook

    Code for Tensorflow Machine Learning Cookbook

    ...The examples illustrate how TensorFlow operations and tensors can be used to build machine learning pipelines and perform tasks such as regression, classification, and clustering. By combining theoretical explanations with executable code, the project helps developers understand how TensorFlow algorithms operate internally while also providing working examples that can be adapted for real projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Machine Learning From Scratch

    Machine Learning From Scratch

    Bare bones NumPy implementations of machine learning models

    ...The repository includes implementations of algorithms ranging from simple models such as linear regression and logistic regression to more complex techniques such as decision trees, support vector machines, clustering methods, and neural networks. Because the code avoids external machine learning libraries, it exposes the full logic behind model training, optimization, and prediction processes. The project also provides examples and explanations that illustrate how the algorithms behave and how different components interact during training.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Coursera Machine Learning

    Coursera Machine Learning

    Coursera Machine Learning By Prof. Andrew Ng

    CourseraMachineLearning is a personal collection of resources, notes, and programming exercises from Andrew Ng’s popular Machine Learning course on Coursera. It consolidates lecture references, programming tutorials, test cases, and supporting materials into one repository for easier review and practice. The project highlights fundamental machine learning concepts such as hypothesis functions, cost functions, gradient descent, bias-variance tradeoffs, and regression models. It also organizes...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 21
    spark-ml-source-analysis

    spark-ml-source-analysis

    Spark ml algorithm principle analysis and specific source code

    ...Instead of providing a runnable software system, the repository focuses on explaining algorithm principles and examining the underlying source code used in Spark’s machine learning package. The repository contains detailed analyses of various algorithms including classification, regression, clustering, dimensionality reduction, and recommendation systems. Each section discusses both the mathematical principles behind the algorithms and how Spark implements them in a distributed computing environment. By studying these implementations, readers gain insight into how large-scale machine learning pipelines operate across distributed data systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Oryx

    Oryx

    Lambda architecture on Apache Spark, Apache Kafka for real-time

    Oryx 2 is a realization of the lambda architecture built on Apache Spark and Apache Kafka, but with specialization for real-time large-scale machine learning. It is a framework for building applications but also includes packaged, end-to-end applications for collaborative filtering, classification, regression and clustering. The application is written in Java, using Apache Spark, Hadoop, Tomcat, Kafka, Zookeeper and more. Configuration uses a single Typesafe Config config file, wherein applications configure an entire deployment of the system. This includes implementations of key interface classes which implement the batch, speed, and serving logic. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Clustering by Shared Subspaces

    Clustering by Shared Subspaces

    Grouping Points by Shared Subspaces for Effective Subspace Clustering

    These functions implement a subspace clustering algorithm, proposed by Ye Zhu, Kai Ming Ting, and Mark J. Carman: "Grouping Points by Shared Subspaces for Effective Subspace Clustering", Published in Pattern Recognition Journal at https://doi.org/10.1016/j.patcog.2018.05.027
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Distance Scaling

    Distance Scaling

    A Distance Scaling Method to Improve Density-Based Clustering

    These functions implement a distance scaling method, proposed by Ye Zhu, Kai Ming Ting, and Maia Angelova, "A Distance Scaling Method to Improve Density-Based Clustering", in PAKDD2018 proceedings: https://doi.org/10.1007/978-3-319-93040-4_31.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    DGRLVQ

    Dynamic Generalized Relevance Learning Vector Quantization

    Some of the usual problems for Learning vector quantization (LVQ) based methods are that one cannot optimally guess about the number of prototypes required for initialization for multimodal data structures i.e.these algorithms are very sensitive to initialization of prototypes and one has to pre define the optimal number of prototypes before running the algorithm. If a prototype, for some reasons, is ‘outside’ the cluster which it should represent and if there are points of a different...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo