View Single Post
  #1  
Old 06-02-2012, 03:27 PM
leonidr leonidr is offline
Junior Member
 
Join Date: Apr 2012
Posts: 1
Default K-means clustering for RBF centers

Why do we disregard the labels when we pick the centers?

I know that the first step is an example of unsupervised learning. But isn't the overall goal classification? And we measure the error of that classification based off of the labels. So, in practice, why would we disregard them in a typical supervised problem?

I know that one difficulty that immediately comes to mind is how do we combine (and/or map) the x and y into a space where the clustering can be performed easily; where the distance metric between two points weighs the x and the y appropriately (an open question I guess). But perhaps a naive approach such as appending the label (+-1) would have some benefit?


My apologies for asking a question outside the realm of the book, but I thought that it was relevant for the class. Any insight would be interesting.
Reply With Quote