LFD Book Forum Relative size of data set v/s test set
 User Name Remember Me? Password
 Register FAQ Calendar Mark Forums Read

 Thread Tools Display Modes
#1
10-29-2014, 08:51 AM
 AnonymousMachine Junior Member Join Date: Oct 2014 Posts: 2
Relative size of data set v/s test set

Hi -

Based on assignment 9, just wanted to understand the significance of relative size of data set and the test set.

As per assignment from a total size of 9298 data points we use only 300 data points and reserve the rest for test set.

Is there any principle/logic behind the data set size and test set size ratio. Having a data set size far less than test set size kind of baffles me.

Any thoughts?

Thanks.
#2
10-29-2014, 06:46 PM
 magdon RPI Join Date: Aug 2009 Location: Troy, NY, USA. Posts: 595
Re: Relative size of data set v/s test set

THis is just to simulate learning with a small data set and the huge test set provides an accurate way to measure Eout.

Quote:
 Originally Posted by AnonymousMachine Hi - Based on assignment 9, just wanted to understand the significance of relative size of data set and the test set. As per assignment from a total size of 9298 data points we use only 300 data points and reserve the rest for test set. Is there any principle/logic behind the data set size and test set size ratio. Having a data set size far less than test set size kind of baffles me. Any thoughts? Thanks.
__________________
Have faith in probability
#3
04-08-2017, 02:04 AM
 diethealthcare Banned Join Date: Apr 2017 Posts: 14
Re: Relative size of data set v/s test set

Great information ,thanks alot.

 Thread Tools Display Modes Linear Mode

 Posting Rules You may not post new threads You may not post replies You may not post attachments You may not edit your posts BB code is On Smilies are On [IMG] code is On HTML code is Off Forum Rules
 Forum Jump User Control Panel Private Messages Subscriptions Who's Online Search Forums Forums Home General     General Discussion of Machine Learning     Free Additional Material         Dynamic e-Chapters         Dynamic e-Appendices Course Discussions     Online LFD course         General comments on the course         Homework 1         Homework 2         Homework 3         Homework 4         Homework 5         Homework 6         Homework 7         Homework 8         The Final         Create New Homework Problems Book Feedback - Learning From Data     General comments on the book     Chapter 1 - The Learning Problem     Chapter 2 - Training versus Testing     Chapter 3 - The Linear Model     Chapter 4 - Overfitting     Chapter 5 - Three Learning Principles     e-Chapter 6 - Similarity Based Methods     e-Chapter 7 - Neural Networks     e-Chapter 8 - Support Vector Machines     e-Chapter 9 - Learning Aides     Appendix and Notation     e-Appendices

All times are GMT -7. The time now is 09:13 AM.

 Contact Us - LFD Book - Top