View Single Post
Old 10-02-2014, 08:17 PM
magdon's Avatar
magdon magdon is offline
Join Date: Aug 2009
Location: Troy, NY, USA.
Posts: 597
Default Re: Page 63 and excercise 2.8

\overline{g}(\textbf{x}) \approx \frac{1}{K} \sum\limits_{k = 1}^{K} g_k(\textbf{x}) because \overline{g}(\textbf{x}) is defined as an expectation with respect to data sets of g(x). The average over data sets approximates this expectation.

Yes, \overline{g}(\textbf{x}) is not a valid hypothesis: it may not be in your hypothesis set; it may not even be binary. It is never used as a classifier. It is just used to represent "what would happen on average after learning", and this abstract function plays a role in defining the bias in the bias variance decomposition.

Originally Posted by Newbrict View Post
I think \overline{g}(\textbf{x}) \approx \frac{1}{K} \sum\limits_{k = 1}^{K} g_k(\textbf{x}) because it's computed over a finite set of points, whereas the actual value for \overline{g}(\textbf{x}) is an exact solution
Have faith in probability
Reply With Quote