LFD Book Forum (http://book.caltech.edu/bookforum/index.php)
-   Chapter 1 - The Learning Problem (http://book.caltech.edu/bookforum/forumdisplay.php?f=108)
-   -   question target distribution (http://book.caltech.edu/bookforum/showthread.php?t=357)

 rohanag 04-17-2012 03:39 AM

question target distribution

In the lectures, we take a deterministic target function and add noise to it to make a target distribution. This is what we have done in the programming assignment too. But is it advisable to introduce noise in a data set which is handed to you? Maybe the data set handed over to you already has noise, maybe not?

One more question, in the lecture, we said noise = y - f(x) ,
Noisy target = deterministic target f(x) = E(y|x) + Noise (y - f(x) )

I do not get the above equation, shouldn't
Noisy target = deterministic target + Noise
Noisy target = f(x) + Noise ( y - f(x) )
Noisy target = y

This is really confusing me.

 cnknd 04-17-2012 02:02 PM

Re: question target distribution

My understanding is that Noisy target = y, but we want to decompose this noisy target into the deterministic component and the noise. The equations from the lectures simply measure the noise component as the difference between the noisy target and the deterministic target (i.e. y - f(x)). We can't really say much more about what the noise is because it depends on the situation in which you're applying the learning algorithm.

As for the examples in the lectures and the assignment where we have to add noise to a deterministic target function, we're only doing so because the sample data that we generated are from the deterministic target function and would therefore not contain any noise. But I don't think any real world situations would present a training set with no noise. We need to manually add noise to the "ideal" data we generated ourselves in order to mimic real world data.

 All times are GMT -7. The time now is 08:07 AM.