LFD Book Forum  

Go Back   LFD Book Forum > General > General Discussion of Machine Learning

Reply
 
Thread Tools Display Modes
  #1  
Old 09-24-2012, 01:51 PM
Andrs Andrs is offline
Member
 
Join Date: Jul 2012
Posts: 47
Default SVM and C-parameter selection

C parameter defines the penalty for violating the SVM margins. The recommendation is to use the CV to find the best C value. What are some typical criteria used to identify a suitable C value??
Here are some statements but I am not sure if they could be used to select a suitable C value???

If we select a very large C value and there is noisy data (+ non linearly separable data), we may select a too narrow margin (hard margins) and we may overfitt if we are using a high dimensional kernel (try to fit noisy). Here we should get a large number of margin support vectors that indicate poor generalization and large E_out.

If we select a too small C, we should get many non-margin support vectors implying a total large number of support vectors(large E_out). Are we underfitting with too small C value???
We should have something in between!
Should we try to minimize the total number of support vectors as the main criteria for selecting C in order to reduce E_out. Or are there other aspects to take into consideration....???
Reply With Quote
  #2  
Old 04-20-2013, 01:26 PM
Elroch Elroch is offline
Invited Guest
 
Join Date: Mar 2013
Posts: 143
Default Re: SVM and C-parameter selection

If I understand correctly, the fact that the selection of C is based on out of sample errors in the cross validation should imply that these problems are avoided, with high probability, if there is enough data.

The questions are what conditions are necessary to ensure this, and how can this statement can be made quantitative? Each value of C is associated with a single hypothesis through the SVM training process, but this mapping is a very complex one.

I presume the size of the data set (and hence the sizes of the training set and the out of sample sets in the cross validation) are key to robust behaviour, but how big they need to be is not so clear to me for a SVM. I suspect this particular issue may be an art rather than a science, but others surely know more.
Reply With Quote
Reply

Tags
c-value, overfitting, svm

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -7. The time now is 05:00 AM.


Powered by vBulletin® Version 3.8.3
Copyright ©2000 - 2019, Jelsoft Enterprises Ltd.
The contents of this forum are to be used ONLY by readers of the Learning From Data book by Yaser S. Abu-Mostafa, Malik Magdon-Ismail, and Hsuan-Tien Lin, and participants in the Learning From Data MOOC by Yaser S. Abu-Mostafa. No part of these contents is to be communicated or made accessible to ANY other person or entity.