Showing 147 open source projects for "clustering"

View related business solutions
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    Seurat

    Seurat

    R toolkit for single cell genomics

    Seurat is a comprehensive R toolkit for single-cell genomics analysis, introduced by the Satija Lab at NYGC. It supports quality control, normalization, clustering, integration of multimodal data (e.g., scRNA‑seq, spatial, CITE‑seq), and visualization. Seurat v5 introduces scalable workflows and spatial transcriptomics support, commonly used in academic and industry research for single-cell studies.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    ...Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Even your multidimensional data can become sensible in 2D, especially with clever attribute ranking and selections. Interactive data exploration for rapid qualitative analysis with clean visualizations. Graphic user interface allows you to focus on exploratory data analysis instead of coding, while clever defaults make fast prototyping of a data analysis workflow extremely easy. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 3
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    awesome-single-cell

    awesome-single-cell

    Community-curated list of software packages and data resources

    ...The package incorporates novel and established methods to provide a flexible framework to perform filtering, quality control, normalization, dimension reduction, clustering, differential expression and a wide-range of plotting. An analytical framework for big-scale single cell data. Transform percentage-based units into a 2d space to evaluate changes in distribution with both magnitude and direction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    Armadillo

    Armadillo

    fast C++ library for linear algebra & scientific computing

    * Fast C++ library for linear algebra (matrix maths) and scientific computing * Easy to use functions and syntax, deliberately similar to Matlab / Octave * Uses template meta-programming techniques to increase efficiency * Provides user-friendly wrappers for OpenBLAS, Intel MKL, LAPACK, ATLAS, ARPACK, SuperLU and FFTW libraries * Useful for machine learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. * Downloads:...
    Leader badge
    Downloads: 2,671 This Week
    Last Update:
    See Project
  • 6

    GromacsProSuite

    Graphical User Interface for Gromacs

    ...Built-in monitoring tracks CPU, RAM, and disk usage to ensure stable performance during parallel processing. Beyond simulation execution, it includes advanced trajectory processing and analysis tools such as RMSD, RMSF, SASA, clustering, PCA, hydrogen-bond analysis, Ramachandran plots, and FEL mapping. With integrated visualization and plotting utilities, it offers a unified platform for researchers, educators, and students to perform complete MD workflows efficiently and reproducibly. Our Goal is to bring MD simulations to every biologist's toolkit.
    Leader badge
    Downloads: 23 This Week
    Last Update:
    See Project
  • 7
    stkpp

    stkpp

    C++ Statistical ToolKit

    STK++ (http://www.stkpp.org) is a versatile, fast, reliable and elegant collection of C++ classes for statistics, clustering, linear algebra, arrays (with an Eigen-like API), regression, dimension reduction, etc. Some functionalities provided by the library are available in the R environment as R functions (http://cran.at.r-project.org/web/packages/rtkore/index.html). At a convenience, we propose the source packages on sourceforge. The library offers a dense set of (mostly) template classes in C++ and is suitable for projects ranging from small one-off projects to complete data mining application suites.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Social Network Visualizer

    Social Network Visualizer

    Social Network Analysis and Visualization software

    Visit our new site: http://socnetv.org Social Network Visualizer (SocNetV) is a social network analysis and visualization application. You can draw a social network (graph/digraph) or load an existing one (GraphML, UCINET, Pajek, etc), compute cohesion, centrality, community and structural equivalence metrics and apply various layout algorithms based on actor centrality or prestige scores (i.e. Eigenvector, Betweenness) or on dynamic models (i.e. Kamada-Kawai spring-embedder)
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    NGSEP

    NGSEP

    NGSEP (Next Generation Sequencing Experience Platform)

    ...The current version provides functionalities for both de-novo and reference guided analysis of sequencing data, including genome assembly, read mapping, variants detection and genotyping and de-novo analysis of data generated from reduced representation protocols. NGSEP also provides modules for analysis of genomic variation databases (VCF files), including functional annotation, filtering, format conversion, comparison, clustering, imputation, introgression analysis and different kinds of statistics. Since version 4, we provide functionalities for management of genomes and transcriptomes, including genome alignment and annotation of transposable elements. A complete list of functionalities is available in our wiki (https://sourceforge.net/p/ngsep/wiki/Home/). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    CAMPARI

    CAMPARI

    Software for molecular simulations and trajectory analysis

    We are proud to introduce version 5 of CAMPARI. We have added a number of new features, most notably a Python interface for interpreting user-supplied code (with the help of ForPy), a novel trajectory storage standard (with the help of libpqxx/PostgreSQL), and a module for performing transition path theory. Naturally, CAMPARI continues to provide the reference implementation of the ABSINTH force field paradigm and implicit solvation model. CAMPARI is a joint package for performing and...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 11

    Faum

    Fast Autonomous Unsupervised Multidimiensional Classification

    This is the proof-of-concept implementation of the FAUM Clustering method. This implementation was used to perform the published results and is now released in the hope that it will be useful.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    RAFTS³G

    RAFTS³G

    Rapid Alignment Free Tool for Sequences Similarity Search to Groups

    | RAFTS3G - Rapid Alignment Free Tool for Sequences Similarity Search to Groups| % Clustering algoritm to create homology clusters based on RAFTS3 (VIALLE, R. A.) % More informations about RAFTS3 in: https://sourceforge.net/projects/rafts3/ % Professional and Technical Education Sector from the Federal University of Paraná - Bioinformatic Lab. % Federal of Paraná University - UFPR % Dr. Alcides Vieira Arco-verde street, 1225 % CEP: 81520-260 Jardim das Américas % Curitiba – PR % Brazil % % Developers and colaborators: % Roberto T. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 13
    MLPACK is a C++ machine learning library with emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and flexibility for expert users. * More info + downloads: https://mlpack.org * Git repo: https://github.com/mlpack/mlpack
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    pycos

    Python framework for concurrent, asynchronous, distributed tasks

    Python framework for concurrent, asynchronous, distributed communicating tasks for broad range of use cases, including public / private / hybrid cloud computing, fog / edge computing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    dataMAPPs

    dataMAPPs

    R based pipeline for MHC-associated peptide proteomics (MAPPs) data

    dataMAPPs allows routine and efficient processing of data from immunogenicity studies applying the MAPPs peptidomics technology to detect potential MHCI- or MHC-II epitopes as presented by dendritic cells (DC). It features quality control of the raw data, across-sample/across-donor normalization and visualization of results in a heatmap style (heatMAPPs). dataMAPPs' core is a generic R library that can be tailored to specific projects via dedicated control scripts which also allow...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    QuantifyPoly(A)

    Quantification of poly(A) sites from 3' end sequencing data

    QuantifyPoly(A) - a tool for quantification of poly(A) sites from 3' end sequencing data. [1] QuantifyPoly(A) user manual Please visit the Wiki page of this website. [2] QuantifyPoly(A) Q&A For Q&A, please visit the Blog page of this website. [3] QuantifyPoly(A) bug report You can report a bug as a Ticket request, or start a topic session in the Discussion webpage of this website.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    NOVA

    NOVA

    Analysis and visualization of complexome profiling data.

    NOVA is a program designed to analysis complexome profiling data (Heide et al., 2012). A graphical user interface (GUI) provides various visualization tools, such as heat maps and 2D plots. Several hierarchical clustering algorithms (e.g., single linkage, average linkage, Wards linkage), different distance measures (e.g., Euclidean distance, Manhattan distance, Pearson distance), and various normalization techniques are implemented. Many additional functions like zooming, searching for proteins, image export, and automatic file format recognition support intuitive handling for biologists. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19

    Spectral Python

    A python module for hyperspectral image processing

    Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This project has moved to GitHub: https://github.com/emmawahl/facepai
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    slurm-roll

    Slurm Resource Manager for Rocks Clusters

    Slurm is a replacement for other resource management software and schedulers like gridengine or torque. The slurm roll integrates very well into a rocks clusters installation. In the folder addons, there are a lot of useful rolls for rocks clusters 6.1 and 6.2. These rolls do not depend on slurm. Please see Discussion for further information and support
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset. MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    GPU,  a Global Processing Unit

    GPU, a Global Processing Unit

    A framework for distributed computing

    An extensible framework for distributed computing on P2P grids. We support peaceful free and open research and build an internet supercomputer. We render movies, solve Eternity puzzles, predict climate and improve a ~30 GHz cluster of clients.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    popt4jlib

    Parallel Optimization Library for Java

    popt4jlib is an open-source parallel optimization library for the Java programming language supporting both shared memory and distributed message passing models. Implements a number of meta-heuristic algorithms for Non-Linear Programming, including Genetic Algorithms, Differential Evolution, Evolutionary Algorithms, Simulated Annealing, Particle Swarm Optimization, Firefly Algorithm, Monte-Carlo Search, Local Search algorithms, Gradient-Descent-based algorithms, as well as some well-known...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Construct2D

    COmputational fluid dyNamics STRUctured grid CreaTor for 2D airfoils

    ...Now available in version 2.0: hyperbolic grid generation to create higher quality grids in a fraction of the time compared to elliptic grid generation. Elliptic grid generation is still available also. Smooth airfoil surface spacing is handled automatically with user-controlled clustering parameters. Also included is a visualizer written in Python with matplotlib.
    Downloads: 4 This Week
    Last Update:
    See Project
Auth0 Logo