This tool uses Random Forest and PAM to cluster observations and to calculate the dissimilarity between observations. It supports on-line prediction of new observations (no need to retrain); and supports datasets that contain both continuous (e.g. CPU load) and categorical (e.g. VM instance type) features. In particular, we use an unsupervised formulation of the Random Forest algorithm to calculate similarities and provide them as input to a clustering algorithm. For the sake of efficiency and meeting the dynamism requirement of autonomic clouds, our methodology consists of two steps: (i) off-line clustering and (ii) on-line prediction.

RF+PAM can:

Cluster observations (Unsupervised Learning)
Calculate the dissimilarity between 2 or more observations (how different two observations are)

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

Follow Unsupervised Random Forest

Unsupervised Random Forest Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Unsupervised Random Forest!

Additional Project Details

Operating Systems

Linux

Intended Audience

Developers, System Administrators

Programming Language

Python

Related Categories

Python Machine Learning Software

Registered

2015-05-21