LFD Book Forum  

Go Back   LFD Book Forum > Course Discussions > Online LFD course > Homework 4

Reply
 
Thread Tools Display Modes
  #1  
Old 09-28-2013, 08:36 AM
Tobias Tobias is offline
Junior Member
 
Join Date: Sep 2013
Posts: 1
Default Online homework 4, question 4

Hi there.

I have some understanding how to find g-bar(x). After a lot of tries, I have got to the following solution, but I am far from sure it is valid.
g-bar(x) must be the h(x)=ax, which minimizes the expected squared error for any point, i.e. the expected value of . Since x is uniformly distributed this is the same as minimizing , which yields a=3/pi =0.955

To this I have a few questions
  1. Am I correct
  2. Does g-bar depend on the size of the sample?
  3. Is there a general approach to find g-bar?
Reply With Quote
  #2  
Old 09-28-2013, 09:11 PM
yaser's Avatar
yaser yaser is offline
Caltech
 
Join Date: Aug 2009
Location: Pasadena, California, USA
Posts: 1,475
Default Re: Online homework 4, question 4

Quote:
Originally Posted by Tobias View Post
Hi there.

I have some understanding how to find g-bar(x). After a lot of tries, I have got to the following solution, but I am far from sure it is valid.
g-bar(x) must be the h(x)=ax, which minimizes the expected squared error for any point, i.e. the expected value of . Since x is uniformly distributed this is the same as minimizing , which yields a=3/pi =0.955

To this I have a few questions
  1. Am I correct
  2. Does g-bar depend on the size of the sample?
  3. Is there a general approach to find g-bar?
Close. What you have calculated is the best approximation of the target using the model, but it is based on knowing the entire target function. If you assume you know only two points at a time (the data set given in the example), then you should fit the two points with a line then get the average of those lines as you vary the two points. You will get something close, but not identical, to the slope you got.

This answers your second question in the affirmative as well. Doing this exercise with two points at a time is not the same as with three points at a time so \bar g does depend on the size of the training set in general.

The general approach to finding \bar g is exactly following the definition. In integral form, it will be a double integral if the data set has two points, triple integral if it has three points etc., but in general it is done with Monte Carlo so no actual integration is needed.
__________________
Where everyone thinks alike, no one thinks very much
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -7. The time now is 07:06 AM.


Powered by vBulletin® Version 3.8.3
Copyright ©2000 - 2019, Jelsoft Enterprises Ltd.
The contents of this forum are to be used ONLY by readers of the Learning From Data book by Yaser S. Abu-Mostafa, Malik Magdon-Ismail, and Hsuan-Tien Lin, and participants in the Learning From Data MOOC by Yaser S. Abu-Mostafa. No part of these contents is to be communicated or made accessible to ANY other person or entity.