View Single Post
  #3  
Old 05-22-2013, 11:57 AM
Elroch Elroch is offline
Invited Guest
 
Join Date: Mar 2013
Posts: 143
Default Re: libsvm random seeding (Q7)

Quote:
Originally Posted by nparslow View Post
I notice that the random division of data for libsvm is not so random:

http://www.csie.ntu.edu.tw/~cjlin/libsvm/faq.html#f421

so for me doing 100 runs just gives the identical answer 100 times.

I was thinking that a way around this might be to permute (randomly) the order of the data sample with each new run - thus meaning the same seed will be used within each run (as advised in the link above) but there would be a change between runs.

Does this seem reasonable? Or should I be doing a random seed for each call of svmtrain even within a single run? or am I going in completely the wrong direction?
If you are doing precisely the same run more than once with the same data with the aim of averaging out noise due to the procedure, then reseeding makes sense to me. The recommendation in the link you gave was not for identical runs: it was saying to fix the seed in order to isolate differences due to changing the parameters. [If you reseed as well as changing parameters this adds noise to the signal due to changing the parameters, so is advised against].
Reply With Quote