![]() |
|
#1
|
|||
|
|||
![]()
To pick the best set of K-Means clusters centroids for each training set, how many times should be generate the centroids to find the best? 10? 100? 1000? The higher number provides the higher probability of finding the the centroids with the lowest Eout.
Thanks, David |
#2
|
|||
|
|||
![]() Quote:
|
#3
|
|||
|
|||
![]()
Different problem If you start with different initial centroids you get different results. If you run it enough you can materially change Ein and Eout because of a better centroid result.
By itself K-means clustering doesn't guarantee an optimal result although it reaches a stable result quickly (especially with so few data points). |
#4
|
|||
|
|||
![]() Quote:
|
#5
|
|||
|
|||
![]()
It was mentioned in lectures that you in general should try different initilizations.
However, I didn't do this... I just started with K random real data points. This strategy favours the SVM model when we compare classification performance, but since we execute the experiment many times this type of bias should be diminishing. |
#6
|
|||
|
|||
![]()
I'm getting a difference of two letters in the answer between using the first k-means cluster returned and testing 50 clusters per training data set.
If I can't get an official answer....those who submitted the correct answer, was it one iteration or a high number of iterations to choose the centroids for RBF? |
![]() |
Thread Tools | |
Display Modes | |
|
|