Showing 27 open source projects for "clustering"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    NGSEP

    NGSEP

    NGSEP (Next Generation Sequencing Experience Platform)

    ...The current version provides functionalities for both de-novo and reference guided analysis of sequencing data, including genome assembly, read mapping, variants detection and genotyping and de-novo analysis of data generated from reduced representation protocols. NGSEP also provides modules for analysis of genomic variation databases (VCF files), including functional annotation, filtering, format conversion, comparison, clustering, imputation, introgression analysis and different kinds of statistics. Since version 4, we provide functionalities for management of genomes and transcriptomes, including genome alignment and annotation of transposable elements. A complete list of functionalities is available in our wiki (https://sourceforge.net/p/ngsep/wiki/Home/). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    jLDADMM

    A Java package for the LDA and DMM topic models

    ...It provides implementations of the Latent Dirichlet Allocation topic model and the one-topic-per-document Dirichlet Multinomial Mixture model (i.e. mixture of unigrams), using collapsed Gibbs sampling. In addition, jLDADMM supplies a document clustering evaluation to compare topic models. See the usage of jLDADMM in its website at http://jldadmm.sourceforge.net/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Deem

    Analyze time-course data with significance tests, clustering, modeling

    Use statistical methods to analyze time-course data (gene expression microarray and RNA-seq data in particular, but not limited to). Apply significance tests to filter out only significant genes or time series. Cluster time series into similar groups. Generate network models, including linear or non-linear models. Variable selection and optimization routines included. Written in Scala and R. The application is a cross-platform desktop app with a simple GUI and is fully functional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Weka4OC GUI for Overlapping clustering

    Weka4OC GUI for Overlapping clustering

    Weka4OC: Weka for Overlapping Clustering is a GUI extending WEKA

    This is a GUI application for learning non disjoint groups based on Weka machine learning framework. It offers a variety of learning methods, based on k-means, able to produce overlapping clusters. The application also contains an evaluation framework that calculates several external validation measures. The application offers a visualization tool to discover overlapping groups.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    ktree

    clustering, machine learning, algorithms

    This project has moved to github at http://lmwtree.devries.ninja.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Unsupervised TXT classifier

    Unsupervised TXT classifier

    Classify any two TXT documents, no training required - JAVA

    ...First, over-training and second, shortage of data for a training of categories. Instead, each TXT file is a category on its own, rather than an assigned category. In a way, this is similar to clustering but not really a clustering algorithm since there is some training involved. The summarizer from Classifier4J has been adjusted to accept two inputs (lets call them A and B). Then, the summarizer gets trained with A to summarize a document B, and vice versa. This extracts a relevant structure for both documents (and thus avoids the over-training) which are then compared using the Vector-Space analysis to give a range of belonging of one document to another (and thus avoids the shortage of information). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    DocCO

    DocCO

    Non-disjoint groupping of Documents based on word sequence approach

    This is a GUI for learning non disjoint groups of documents based on Weka machine learning framework. It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing the learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TAXOMO
    Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    SLEDRIDE: Simplified Learning about Expression Data Running in a Desktop Environment. To provide a general workbench for pipe-lining microarray gene expression data from supervised learning results into unsupervised learning methods.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Java package to study a clustering model described in the paper \"Novel Clustering Algorithm Based Upon Games on Evolving Network\" by Q. Li, Z. Chen, Y. He and J-P. Jiang (in arxiv: http://arxiv.org/pdf/0812.5064v1), generalizations and similar issues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SONIVIS:Tool aims at analysing social (virtual) information spaces like Wikis. These spaces are investigated by using different network definitions (collaboration/information networks). Clustering algorithms and statistiscal analyses are provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Clown is a "clustering" framework. It allows you to cluster datasets (in ARFF) format using a number of different clustering algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SYRAH si propone di far emergere e rappresentare i concetti espressi per mezzo di un linguaggio naturale. SYRAH aims to discover and represent concepts expressed in natural languages. NLP, lemma, lemmario, italiano, rete, semantica, clustering, semantic
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GoldenOrb is a java library under the Apache License V2.0 for correlation, summarization and clustering of text information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The Databionics ESOM Tools offer many data mining tasks using Emergent Self-Organizing Maps. Visualization, clustering, and classification of high-dimensional data using databionics principles can be performed interactively or automatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    JMiner is a (not yet!) complete data mining and artificial intelligence solution written in Java. Support for neural networks, genetic algorithms/programming, decision trees, clustering, market basket analysis, link analysis, data cleansing, and others.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MicroArray Genome Imaging and Clustering Tool (MAGIC tool) is a platform-independant java program for analyzing MicroArray data (.tiff scans & .txt godlists) via graphs and clustering operations (including QT-clustering). http://www.bio.davidson.edu/magic
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    brCluster is a class library, written in java, that implements generic clustering algorithms carefully designed to allow its aplication in any kind of data. The algorithms implemented are K-means and Hierarchical Clustering (Simple and Complete Link).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    JUDGE (Java Utility for Document Genre Eduction) features automatic classification and clustering of documents, optionally as a webservice. The program is written entirely in Java and makes use of the Weka machine learning toolkit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    TM4 is a suite of applications for managing and analyzing microarray data. TM4 provides data storage and tracking, image analysis, normalization, data filtering, clustering and statistical analysis capabilities. Includes MADAM, Spotfinder, MIDAS, and MeV.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Internet Censor is a multi-platform, Internet clustering program, for which the resulting data will be used in the creation of a non-profit content-filtering Internet Search Engine for children.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo