- **Chapter 1 - The Learning Problem**
(*http://book.caltech.edu/bookforum/forumdisplay.php?f=108*)

- - **question target distribution**
(*http://book.caltech.edu/bookforum/showthread.php?t=357*)

question target distributionIn the lectures, we take a deterministic target function and add noise to it to make a target distribution. This is what we have done in the programming assignment too. But is it advisable to introduce noise in a data set which is handed to you? Maybe the data set handed over to you already has noise, maybe not?
One more question, in the lecture, we said noise = y - f(x) , Noisy target = deterministic target f(x) = E(y|x) + Noise (y - f(x) ) I do not get the above equation, shouldn't Noisy target = deterministic target + Noise Noisy target = f(x) + Noise ( y - f(x) ) Noisy target = y This is really confusing me. |

Re: question target distributionMy understanding is that Noisy target = y, but we want to decompose this noisy target into the deterministic component and the noise. The equations from the lectures simply measure the noise component as the difference between the noisy target and the deterministic target (i.e. y - f(x)). We can't really say much more about what the noise is because it depends on the situation in which you're applying the learning algorithm.
As for the examples in the lectures and the assignment where we have to add noise to a deterministic target function, we're only doing so because the sample data that we generated are from the deterministic target function and would therefore not contain any noise. But I don't think any real world situations would present a training set with no noise. We need to manually add noise to the "ideal" data we generated ourselves in order to mimic real world data. |

All times are GMT -7. The time now is 07:20 PM. |

Powered by vBulletin® Version 3.8.3

Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.

The contents of this forum are to be used ONLY by readers of the Learning From Data book by Yaser S. Abu-Mostafa, Malik Magdon-Ismail, and Hsuan-Tien Lin, and participants in the Learning From Data MOOC by Yaser S. Abu-Mostafa. No part of these contents is to be communicated or made accessible to ANY other person or entity.