LFD Book Forum

LFD Book Forum (http://book.caltech.edu/bookforum/index.php)
-   Homework 1 (http://book.caltech.edu/bookforum/forumdisplay.php?f=130)
-   -   Probablity distribution on X (http://book.caltech.edu/bookforum/showthread.php?t=836)

goodrm 07-10-2012 09:26 PM

Probablity distribution on X
 
I was looking over lecture #2. The following statement was made:

"You use probablity to assume points x(1) up to x(n)"

From slide 9 (lecture 2), this seems to infer that the training samples are generated from the probability distribution. Am I correct in my understanding?

Before proability was introduced, it seemed that we were discussing training samples as representing actual customer data such as (salary, years in residence, Years at job,etc).

I am having some trouble understanding how the probability assumes points on x and what this really means. Is this sort of like a random generator?

Any thoughts would be earnestly appreciated.

yaser 07-10-2012 10:23 PM

Re: Probablity distribution on X
 
Quote:

Originally Posted by goodrm (Post 3365)
From slide 9 (lecture 2), this seems to infer that the training samples are generated from the probability distribution. Am I correct in my understanding?

Before proability was introduced, it seemed that we were discussing training samples as representing actual customer data such as (salary, years in residence, Years at job,etc).

I am having some trouble understanding how the probability assumes points on x and what this really means. Is this sort of like a random generator?

Your understanding is correct. The inputs of the data set {\bf x}_1, \cdots , {\bf x}_N are now assumed to be generated by some probability distribution. This does not change the nature of the {\bf x}_n's; they are still customer data in the the credit example. The only change is that those customers are now assumed to be pulled from the population according to some distribution that will also be used to generate new customers to test the learned hypothesis.

The assumption is benign since no restrictions are made about what kind of distribution it is, and we don't even need to know what the distribution is. It is just a mechanism that allows us to invoke probabilistic analysis, which is needed to establish the feasibility of learning as discussed in the lecture.

goodrm 07-11-2012 09:52 AM

Re: Probablity distribution on X
 
This clears things up!! Thanks for the quick response. I guess I was stuck in the mode of thinking in terms of real data based on actual applicants. Many Thanks!!

defopuvo 06-17-2019 05:51 AM

Re: Probablity distribution on X
 
thank you so much.: Clueless:


All times are GMT -7. The time now is 06:38 PM.

Powered by vBulletin® Version 3.8.3
Copyright ©2000 - 2019, Jelsoft Enterprises Ltd.
The contents of this forum are to be used ONLY by readers of the Learning From Data book by Yaser S. Abu-Mostafa, Malik Magdon-Ismail, and Hsuan-Tien Lin, and participants in the Learning From Data MOOC by Yaser S. Abu-Mostafa. No part of these contents is to be communicated or made accessible to ANY other person or entity.