Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Scientific/Engineering
Information Analysis Software
Search Results

Search Results for "clustering"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 27
Windows 27
Mac 24
More...
BSD 14
ChromeOS 13
Desktop Operating Systems 1
Embedded Operating Systems 1

Category

Scientific/Engineering 27
Artificial Intelligence 14
Business 12
Internet 6
Software Development 5
System 5
Database 2
Education 1

License

OSI-Approved Open Source 24
Creative Commons Attribution License 1
Public Domain 1

Translations

English 9
Spanish 2
Dutch 1
French 1
More...
German 1
Italian 1
Russian 1

Programming Language

Java 27
C++ 3
C 2
MATLAB 2
Python 2
More...
AspectJ 1
JavaScript 1
Perl 1
S/R 1
Scala 1

Status

Beta 9
Production/Stable 6
Planning 4
Pre-Alpha 4
More...
Alpha 3
Mature 1

Showing 27 open source projects for "clustering"

View related business solutions

Information Analysis Java Clear Filters & Widen Search

Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

Elasticsearch

A Distributed RESTful Search Engine

Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.

Downloads: 4 This Week

Last Update: 3 days ago
See Project
2

NGSEP

NGSEP (Next Generation Sequencing Experience Platform)

...The current version provides functionalities for both de-novo and reference guided analysis of sequencing data, including genome assembly, read mapping, variants detection and genotyping and de-novo analysis of data generated from reduced representation protocols. NGSEP also provides modules for analysis of genomic variation databases (VCF files), including functional annotation, filtering, format conversion, comparison, clustering, imputation, introgression analysis and different kinds of statistics. Since version 4, we provide functionalities for management of genomes and transcriptomes, including genome alignment and annotation of transposable elements. A complete list of functionalities is available in our wiki (https://sourceforge.net/p/ngsep/wiki/Home/). ...

1 Review

Downloads: 0 This Week

Last Update: 2026-06-03
See Project
3

DynaQ

Innovative text document search. http://dynaq.opendfki.de for details.

The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de

Downloads: 0 This Week

Last Update: 2021-08-05
See Project
4

jLDADMM

A Java package for the LDA and DMM topic models

...It provides implementations of the Latent Dirichlet Allocation topic model and the one-topic-per-document Dirichlet Multinomial Mixture model (i.e. mixture of unigrams), using collapsed Gibbs sampling. In addition, jLDADMM supplies a document clustering evaluation to compare topic models. See the usage of jLDADMM in its website at http://jldadmm.sourceforge.net/

1 Review

Downloads: 0 This Week

Last Update: 2016-03-13
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

Java Data Mining Package

The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).

Downloads: 0 This Week

Last Update: 2015-08-19
See Project
6

Deem

Analyze time-course data with significance tests, clustering, modeling

Use statistical methods to analyze time-course data (gene expression microarray and RNA-seq data in particular, but not limited to). Apply significance tests to filter out only significant genes or time series. Cluster time series into similar groups. Generate network models, including linear or non-linear models. Variable selection and optimization routines included. Written in Scala and R. The application is a cross-platform desktop app with a simple GUI and is fully functional...

Downloads: 0 This Week

Last Update: 2015-02-01
See Project
7

Weka4OC GUI for Overlapping clustering

Weka4OC: Weka for Overlapping Clustering is a GUI extending WEKA

This is a GUI application for learning non disjoint groups based on Weka machine learning framework. It offers a variety of learning methods, based on k-means, able to produce overlapping clusters. The application also contains an evaluation framework that calculates several external validation measures. The application offers a visualization tool to discover overlapping groups.

1 Review

Downloads: 0 This Week

Last Update: 2014-02-22
See Project
8

ktree

clustering, machine learning, algorithms

This project has moved to github at http://lmwtree.devries.ninja.

Downloads: 0 This Week

Last Update: 2015-03-15
See Project
9

Unsupervised TXT classifier

Classify any two TXT documents, no training required - JAVA

...First, over-training and second, shortage of data for a training of categories. Instead, each TXT file is a category on its own, rather than an assigned category. In a way, this is similar to clustering but not really a clustering algorithm since there is some training involved. The summarizer from Classifier4J has been adjusted to accept two inputs (lets call them A and B). Then, the summarizer gets trained with A to summarize a document B, and vice versa. This extracts a relevant structure for both documents (and thus avoids the over-training) which are then compared using the Vector-Space analysis to give a range of belonging of one document to another (and thus avoids the shortage of information). ...

Downloads: 0 This Week

Last Update: 2013-12-19
See Project
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
10

DocCO

Non-disjoint groupping of Documents based on word sequence approach

This is a GUI for learning non disjoint groups of documents based on Weka machine learning framework. It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing the learning.

Downloads: 0 This Week

Last Update: 2013-08-17
See Project
11

TAXOMO

Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).

Downloads: 3 This Week

Last Update: 2013-04-24
See Project
12

SLEDRIDE

SLEDRIDE: Simplified Learning about Expression Data Running in a Desktop Environment. To provide a general workbench for pipe-lining microarray gene expression data from supervised learning results into unsupervised learning methods.

Downloads: 0 This Week

Last Update: 2013-05-21
See Project
13

Evolving Game for Unnatural Intelligence

Java package to study a clustering model described in the paper \"Novel Clustering Algorithm Based Upon Games on Evolving Network\" by Q. Li, Z. Chen, Y. He and J-P. Jiang (in arxiv: http://arxiv.org/pdf/0812.5064v1), generalizations and similar issues.

Downloads: 0 This Week

Last Update: 2016-02-03
See Project
14

SONIVIS:Tool

SONIVIS:Tool aims at analysing social (virtual) information spaces like Wikis. These spaces are investigated by using different network definitions (collaboration/information networks). Clustering algorithms and statistiscal analyses are provided.

Downloads: 0 This Week

Last Update: 2013-04-22
See Project
15

Clown

Clown is a "clustering" framework. It allows you to cluster datasets (in ARFF) format using a number of different clustering algorithms.

Downloads: 0 This Week

Last Update: 2013-03-25
See Project
16

SYR<=>SPR

SYRAH si propone di far emergere e rappresentare i concetti espressi per mezzo di un linguaggio naturale. SYRAH aims to discover and represent concepts expressed in natural languages. NLP, lemma, lemmario, italiano, rete, semantica, clustering, semantic

Downloads: 0 This Week

Last Update: 2015-07-15
See Project
17

GoldenOrb

GoldenOrb is a java library under the Apache License V2.0 for correlation, summarization and clustering of text information.

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
18

Databionic ESOM Tools

The Databionics ESOM Tools offer many data mining tasks using Emergent Self-Organizing Maps. Visualization, clustering, and classification of high-dimensional data using databionics principles can be performed interactively or automatically.

Downloads: 0 This Week

Last Update: 2013-06-04
See Project
19

Word Vector Tool

The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
20

jMiner

JMiner is a (not yet!) complete data mining and artificial intelligence solution written in Java. Support for neural networks, genetic algorithms/programming, decision trees, clustering, market basket analysis, link analysis, data cleansing, and others.

Downloads: 0 This Week

Last Update: 2016-07-27
See Project
21

MagicTool

MicroArray Genome Imaging and Clustering Tool (MAGIC tool) is a platform-independant java program for analyzing MicroArray data (.tiff scans & .txt godlists) via graphs and clustering operations (including QT-clustering). http://www.bio.davidson.edu/magic

1 Review

Downloads: 0 This Week

Last Update: 2016-05-08
See Project
22

brCluster

brCluster is a class library, written in java, that implements generic clustering algorithms carefully designed to allow its aplication in any kind of data. The algorithms implemented are K-means and Hierarchical Clustering (Simple and Complete Link).

Downloads: 0 This Week

Last Update: 2015-12-02
See Project
23

Judge

JUDGE (Java Utility for Document Genre Eduction) features automatic classification and clustering of documents, optionally as a webservice. The program is written entirely in Java and makes use of the Weka machine learning toolkit.

Downloads: 0 This Week

Last Update: 2015-12-01
See Project
24

TM4 Microarray Software Suite

TM4 is a suite of applications for managing and analyzing microarray data. TM4 provides data storage and tracking, image analysis, normalization, data filtering, clustering and statistical analysis capabilities. Includes MADAM, Spotfinder, MIDAS, and MeV.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
25

The Internet Censor

The Internet Censor is a multi-platform, Internet clustering program, for which the resulting data will be used in the creation of a non-profit content-filtering Internet Search Engine for children.

Downloads: 0 This Week

Last Update: 2013-03-11
See Project

Previous
You're on page 1
2
Next

Related Searches

k-means clustering

elasticsearch

ngsep

document classification

latent dirichlet allocation

algorithms

summarizer

social network

self organizing maps

wvtool

Related Categories

Scientific/Engineering

Artificial Intelligence

Business

Internet

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise