LFD Book Forum  

Go Back   LFD Book Forum > Course Discussions > Online LFD course > Homework 2

Reply
 
Thread Tools Display Modes
  #1  
Old 04-10-2013, 12:14 PM
Katie C. Katie C. is offline
Member
 
Join Date: Apr 2013
Posts: 17
Default Hard to understand what is wanted in HW2, Q6

I am confused about what to do in Question 6. Reading the previous posts on the forum only made me more confused.

To clarify I am using the following variable names:
N = # of points in the training set
Ntest = # of points in the testing set
Nexp = # Number of runs of the experiment

In question 6, Ntest = 1000 and Nexp = 1000 and we use N=100 of question 5.

Am I correct in assuming that for each of the Nexp runs we:
1. Generate f(x)
2. Generate N training points
3. Estimate g(x)
4. Generate Ntest testing points
5. Evaluate g(x) on the Ntest testing points and record the Eout for that run.

After completing Nexp runs, average the Eout values to get a final estimate for Eout.

*OR* do we only perform steps 1-3 one time and then repeat steps 4 and 5 Nexp times?
Reply With Quote
  #2  
Old 04-10-2013, 12:53 PM
yaser's Avatar
yaser yaser is offline
Caltech
 
Join Date: Aug 2009
Location: Pasadena, California, USA
Posts: 1,477
Default Re: Hard to understand what is wanted in HW2, Q6

Quote:
Originally Posted by Katie C. View Post
N = # of points in the training set
Ntest = # of points in the testing set
Nexp = # Number of runs of the experiment

In question 6, Ntest = 1000 and Nexp = 1000 and we use N=100 of question 5.

Am I correct in assuming that for each of the Nexp runs we:
1. Generate f(x)
2. Generate N training points
3. Estimate g(x)
4. Generate Ntest testing points
5. Evaluate g(x) on the Ntest testing points and record the Eout for that run.

After completing Nexp runs, average the Eout values to get a final estimate for Eout.
Hi,

You got it.
__________________
Where everyone thinks alike, no one thinks very much
Reply With Quote
  #3  
Old 04-14-2013, 02:04 PM
jtsengcr jtsengcr is offline
Junior Member
 
Join Date: Apr 2013
Posts: 3
Unhappy Re: Hard to understand what is wanted in HW2, Q6

Are you sure? My thought for question 5 and 6 was:

1. Generate target function f(x1, x2) only once.
2. Generate a large data set D (x1, x2, y) where y = f(x1, x2).

loop 1000 times
1. Take 100 points from D in space limited for training.
2. Linear regression using the 100 training points for g(x1, x2).
3. Evaluate whether g(x1, x2) = y for the 100 training points, get Ein.
4. Take 1000 points from D in space reserved for testing.
5. Evaluate whether g(x1, x2) = y for the 1000 testing points, get Eout.

Average Ein.
Average Eout.

I did not get the right answer, so, please correct my logic.
Reply With Quote
  #4  
Old 04-14-2013, 03:02 PM
yaser's Avatar
yaser yaser is offline
Caltech
 
Join Date: Aug 2009
Location: Pasadena, California, USA
Posts: 1,477
Default Re: Hard to understand what is wanted in HW2, Q6

Quote:
Originally Posted by jtsengcr View Post
Are you sure? My thought for question 5 and 6 was:

1. Generate target function f(x1, x2) only once.
2. Generate a large data set D (x1, x2, y) where y = f(x1, x2).
When you do this, the results will depend on which target function you have. By generating a random target function every time, you take out that dependency. Of course you can be lucky in the above scenario and get a target function that is typical rather than odd, so you end up with the same answer.
__________________
Where everyone thinks alike, no one thinks very much
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -7. The time now is 05:07 PM.


Powered by vBulletin® Version 3.8.3
Copyright ©2000 - 2019, Jelsoft Enterprises Ltd.
The contents of this forum are to be used ONLY by readers of the Learning From Data book by Yaser S. Abu-Mostafa, Malik Magdon-Ismail, and Hsuan-Tien Lin, and participants in the Learning From Data MOOC by Yaser S. Abu-Mostafa. No part of these contents is to be communicated or made accessible to ANY other person or entity.