![]() |
#1
|
|||
|
|||
![]()
Hello again dear Professor and all!
I want to determine VC dimension of time series models in order to avoid overfitting and estimate minimum size of data set. 1. First, maybe incorrect question as it does not articulate specific hypothesis space. A model takes input vector ![]() ![]() ![]() 2. Second, concrete time series model I'm working on, based on the article of Liehr and Pawelzik "A trading strategy with variable investment from minimizing risk to profit ratio" published in Physica A 287 (2000) 524-538. Let me explain it briefly. Liehr and Pawelzik compare performance of two related models. Both models construct the series of input vectors by embedding the time series of returns into a space of embedding dimension ![]() ![]() a) discrete state model. Taking signs of ![]() ![]() b) RBF neural network. Training is performed by unsupervised adaptation of centers and subsequent gradient descent to adjust the second layer weights. For comparability, number of centers, Gaussians, is chosen equal to number of states ![]() Now, to my question. Liehr and Pawelzik do not use term 'VC dimension' but urge to avoid overfitting by using only a small number of Gaussians. In our terms ![]() Am I correct, is VC dimension of the two models is approximately ![]() ![]() |
#2
|
||||
|
||||
![]()
Interesting problem. First, as you anticipated, just specifying the input as
![]() ![]() Now with the two models (both are similarity-based models), I assume that the forecast is binary. You have grouped inputs into 32 categories, with all inputs in the same category necessarily mapping to the same ![]() The equality of number of centers and number of support vectors in Lecture 16 was a forced assumption for comparison, but they need not be equal. The number of support vectors comes out of the process of solving the SVM kernel problem, whereas the number of clusters is a parameter under our control that we decide on before running Lloyd's.
__________________
Where everyone thinks alike, no one thinks very much |
#3
|
|||
|
|||
![]()
Thank you very much, Professor.
I have overlooked this apparent fact that number of centers in RBF model equals to estimated parameters, weights, and hence VC-dimension. For the sake of brevity I haven't explained what the models forecast. It does not affect you conclusion but it's an interesting part on its own. Forecast isn't binary, they forecast 2 real valued outputs: return and variance of return. For discrete state model these 2 values are just a constant average of similar states ( ![]() Dear Professor, in this model I was going to substitute RBF network with support vector regression. But I remember you noticed ones that support vector regression isn't that good as SVM for classification is. Is it worth of trying? Stay with RBF? |
#4
|
||||
|
||||
![]()
Worth a try. You never know which model will work best in which real-life problem.
__________________
Where everyone thinks alike, no one thinks very much |
#5
|
|||
|
|||
![]()
Thank you!
|
#6
|
|||
|
|||
![]()
thank you for this all information because they are helpfull for me a lot.
![]() |
#7
|
|||
|
|||
![]()
Awesome thanks for this info. I love this community
![]() |
![]() |
Thread Tools | |
Display Modes | |
|
|