I did Clustering with K-MEANS model and I wish to understand how the variables importance percentages in the histogram are calculated? what does it measure?
We fit a simple random forest supervised model to the output classes of the kmeans. This allows us to derive variable importances, as per the random forest standard method (implemented in scikit-learn).
We fit a simple random forest supervised model to the output classes of the kmeans. This allows us to derive variable importances, as per the random forest standard method (implemented in scikit-learn).