This tool uses Random Forest and PAM to cluster observations and to calculate the dissimilarity between observations. It supports on-line prediction of new observations (no need to retrain); and supports datasets that contain both continuous (e.g. CPU load) and categorical (e.g. VM instance type) features. In particular, we use an unsupervised formulation of the Random Forest algorithm to calculate similarities and provide them as input to a clustering algorithm. For the sake of efficiency and meeting the dynamism requirement of autonomic clouds, our methodology consists of two steps: (i) off-line clustering and (ii) on-line prediction.

RF+PAM can:

Cluster observations (Unsupervised Learning)
Calculate the dissimilarity between 2 or more observations (how different two observations are)

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

Follow Unsupervised Random Forest

Unsupervised Random Forest Web Site

Other Useful Business Software
Powering the best of the internet | Fastly Icon
Powering the best of the internet | Fastly

Fastly's edge cloud platform delivers faster, safer, and more scalable sites and apps to customers.

Ensure your websites, applications and services can effortlessly handle the demands of your users with Fastly. Fastly’s portfolio is designed to be highly performant, personalized and secure while seamlessly scaling to support your growth.
Try for free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Unsupervised Random Forest!

Additional Project Details

Operating Systems

Linux

Intended Audience

System Administrators, Developers

Programming Language

Python

Related Categories

Python Machine Learning Software

Registered

2015-05-21