This tool uses Random Forest and PAM to cluster observations and to calculate the dissimilarity between observations. It supports on-line prediction of new observations (no need to retrain); and supports datasets that contain both continuous (e.g. CPU load) and categorical (e.g. VM instance type) features. In particular, we use an unsupervised formulation of the Random Forest algorithm to calculate similarities and provide them as input to a clustering algorithm. For the sake of efficiency and meeting the dynamism requirement of autonomic clouds, our methodology consists of two steps: (i) off-line clustering and (ii) on-line prediction.

RF+PAM can:

Cluster observations (Unsupervised Learning)
Calculate the dissimilarity between 2 or more observations (how different two observations are)

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

Follow Unsupervised Random Forest

Unsupervised Random Forest Web Site

Other Useful Business Software
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
Get a free trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Unsupervised Random Forest!

Additional Project Details

Operating Systems

Linux

Intended Audience

Developers, System Administrators

Programming Language

Python

Related Categories

Python Machine Learning Software

Registered

2015-05-21