LFD Book Forum  

Go Back   LFD Book Forum > Book Feedback - Learning From Data > Chapter 4 - Overfitting

Reply
 
Thread Tools Display Modes
  #1  
Old 10-21-2014, 03:10 PM
physicsme physicsme is offline
Junior Member
 
Join Date: Oct 2014
Posts: 2
Default Figure 4.3(b)

The title of Figure 4.3(b) is "Deterministic noise". However, the "overfitting level first decrease, hit sweet spot, then increase" trend with increase of Qf is the result of not only the deterministic noise, but of the stochastic noise as well. The reason that the fixed hypothesis set overfits the data when the target function is extremely simple is the existence of stochastic noise. If we eliminate that, a 10th order polynomial hypothesis set will fit a 2nd order polynomial target function exactly as it is.
In this sense, the title "Deterministic noise" of figure 4.3(b) is a bit misleading.

Actually, I became aware of this while doing exercise 4.3. At first I thought the answer should be "deterministic noise will go up all the way with increase of target complexity", then I looked at Fig. 4.3(b) and thought "hey, it says deterministic noise will first go down and then go up!". But come to think of it, the figure is really Eout(H10)-Eout(H2), which include effect of both stochastic and deterministic noise, hence the post.
Reply With Quote
  #2  
Old 12-17-2014, 11:13 AM
ypeels ypeels is offline
Member
 
Join Date: Dec 2014
Posts: 17
Default Re: Figure 4.3(b)

An explanation by @yaser was posted on the 2014 edX forum:

Quote:
Q. Why is the bottom part of this figure behaving differently?

A. This is an artifact and it has to do with our choice of the models (2nd and 10th order polynomials). For Qf≤10 there is no deterministic noise for H10 (we can perfectly fit them).

Q. Why did we add stochastic noise to the target when generating the above figure; arenít we just analyzing deterministic noise?

A. We wanted to compare the two figures fairly. When plotting the impact of the stochastic noise, we already had some built-in deterministic noise in our target function as well.
.
Reply With Quote
  #3  
Old 12-17-2014, 11:22 AM
ypeels ypeels is offline
Member
 
Join Date: Dec 2014
Posts: 17
Default Re: Figure 4.3(b)

Another question about this figure: is there any intuition as to why regions of a given color (fixed overfit measure) are roughly linear in the stochastic N-sigma^2 graph, but non-linear (exponential?) in the deterministic N-Qf graph (for Qf > 10)?

Thank you!
Reply With Quote
  #4  
Old 12-17-2014, 01:09 PM
yaser's Avatar
yaser yaser is offline
Caltech
 
Join Date: Aug 2009
Location: Pasadena, California, USA
Posts: 1,475
Default Re: Figure 4.3(b)

Quote:
Originally Posted by ypeels View Post
Another question about this figure: is there any intuition as to why regions of a given color (fixed overfit measure) are roughly linear in the stochastic N-sigma^2 graph, but non-linear (exponential?) in the deterministic N-Qf graph (for Qf > 10)?

Thank you!
The analysis of the stochastic noise figure may be doable given the clean analytic components of the simulation (Legendre, pseudo-inverse, Gaussian noise). In the deterministic noise figure, the noise value is quantified by the complexity of the target Q_f. While deterministic noise (the part of f that cannot be captured by \mathcal H) is indeed related to Q_f, it is not necessarily linearly related to it so that direct parallel with what happens with stochastic noise does not hold.

BTW, the LaTeX stuff is done by delimiters [ math ] and [ /math ] (without the spaces) instead of $ and $. A bit cumbersome when you use a lot of math.
__________________
Where everyone thinks alike, no one thinks very much
Reply With Quote
Reply

Tags
figure, noise, suggestion

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -7. The time now is 11:36 PM.


Powered by vBulletin® Version 3.8.3
Copyright ©2000 - 2019, Jelsoft Enterprises Ltd.
The contents of this forum are to be used ONLY by readers of the Learning From Data book by Yaser S. Abu-Mostafa, Malik Magdon-Ismail, and Hsuan-Tien Lin, and participants in the Learning From Data MOOC by Yaser S. Abu-Mostafa. No part of these contents is to be communicated or made accessible to ANY other person or entity.