Quote:
Originally Posted by doneit
I've checked my understanding of set-notation but am I correct in thinking the question says "H' is-a-subset of H", ie, less complex?
To me the most "general" case is having a large, possibly infinite, H where the bulk would be overly complex. So my (wrong) answer assumes that by reducing complexity of the set I'm reducing the chance of over-fitting. So "in general" I'm *reducing* the chance of deterministic noise.
|
Your understanding is correct in that a subset will be generally less complex than the original set. I think the source of confusion is that overfitting is affected by both the level of noise and the complexity of the model, and for deterministic noise the complexity in turn affects the level of noise. The question, however, addresses deterministic noise only, not necessarily the overfitting effect that results from it. The part which is not accurate in the above quote is the last sentence that mixes the two.
Quote:
Btw, I'm enjoying the video lectures immensely. It's like watch a soap opera with twists & turns, plots & sub-plots, and just when the situation seems impossible the magician doffs his top hat and pulls out another white rabbit I actually giggled at the cross-validation trick
|
If I had thought of that, I would have inserted some commercial breaks at the right moments.