farsamo Posted November 26, 2012 Posted November 26, 2012 Hi I doing unsupervised machine learning experiments. I used K-means algorithms however my question is how can I evaluate how good the data was grouped. I found something called Validity index. Would you please guide me how to evaluate clusters. Thanks
elnetrino Posted December 4, 2012 Posted December 4, 2012 Try to use datasets where the documents are classifiied into known categories, i.e. datasets used for classification experiments. This is a nice way to examine the results of clustering algorithms. Use a k-means and initialize it with observations from each class. You can also specify number of clusters equal to the number of classes, and see if the clusters that are created match the classes.
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now