LFD Book Forum question about probability

#1
04-08-2012, 01:32 AM
 canon1230 Junior Member Join Date: Apr 2012 Posts: 2

If an urn contains 100 green or red marbles, and you sample 10 and they are all green, what is the probability that they are all green?
#2
04-08-2012, 03:30 AM
 htlin NTU Join Date: Aug 2009 Location: Taipei, Taiwan Posts: 601

Quote:
 Originally Posted by canon1230 If an urn contains 100 green or red marbles, and you sample 10 and they are all green, what is the probability that they are all green?
The short reply is that probability does not tell us this answer, unless there are more assumptions.

Put it to the simplest case, if you flip a coin and get a head, what is the head probability of the coin? The answer can be "any non-zero number" but there is no further information to pin it down.

Hope this helps.
__________________
When one teaches, two learn.
#3
04-08-2012, 04:26 AM
 GraceLAX Junior Member Join Date: Apr 2012 Location: LAX Posts: 4

Quote:
 Originally Posted by canon1230 If an urn contains 100 green or red marbles, and you sample 10 and they are all green, what is the probability that they are all green?
A physics professor friend wrote this up to help clarify a common statistical misconception.
Do these help? Read through the second one for a similar example.

#4
04-08-2012, 09:44 AM
 canon1230 Junior Member Join Date: Apr 2012 Posts: 2

P[|Ein - Eout| > epsilon] <= 2e^(-2 * epslion^2 * N)

Since Ein = 0, N = 10, setting epsilon to 0.5, the inequality gives us:

P[Eout > 0.5] <= 2e^(-5) = 0.013+

This seems to be saying something nontrivial about Eout.
#5
04-08-2012, 04:01 PM
 htlin NTU Join Date: Aug 2009 Location: Taipei, Taiwan Posts: 601

Quote:
 Originally Posted by canon1230 Does the Hoeffding Inequality allow us to say something about this probability? P[|Ein - Eout| > epsilon] <= 2e^(-2 * epslion^2 * N) Since Ein = 0, N = 10, setting epsilon to 0.5, the inequality gives us: P[Eout > 0.5] <= 2e^(-5) = 0.013+ This seems to be saying something nontrivial about Eout.
The in Hoeffding is subject to the process of generating the sample (i.e. ), not the probability on . Indeed it tells us something nontrivial (and that's how we use it in the learning context), but it does not answer your original question.

The question that got answered by Hoeffding is roughly

"What is the probability of a big-Eout urn (many red) for generating such an Ein (all green)?"

not

"What is the probability of Eout being small in the first place?"

The answer to the latter question remains unknown, but even so, we know that having a big Eout is unlikely because of Hoeffding.

Hope this helps.
__________________
When one teaches, two learn.
#6
04-10-2012, 08:19 AM
 rukacity Member Join Date: Apr 2012 Posts: 21

I would think this way:

if p is probability of the outcome then for 10 trials there is p^10 probability of getting all favorable outcome.
#7
04-14-2012, 12:00 AM
 jsarrett Member Join Date: Apr 2012 Location: Sunland, CA Posts: 13

Hoeffding's inequality has a free parameter in your question, namely . It lest you say that since you have 10 samples, if you want to be 80% sure of the distribution of marbles in the jar (), then the jar is at most percent different from the sample.

where

simplifying:

so we think that with 80% confidence, the jar is at least 10.2% green.

We probably need a bigger N!

 Tags marble, probability, urn

 Thread Tools Display Modes Hybrid Mode

 Posting Rules You may not post new threads You may not post replies You may not post attachments You may not edit your posts BB code is On Smilies are On [IMG] code is On HTML code is Off Forum Rules
 Forum Jump User Control Panel Private Messages Subscriptions Who's Online Search Forums Forums Home General     General Discussion of Machine Learning     Free Additional Material         Dynamic e-Chapters         Dynamic e-Appendices Course Discussions     Online LFD course         General comments on the course         Homework 1         Homework 2         Homework 3         Homework 4         Homework 5         Homework 6         Homework 7         Homework 8         The Final         Create New Homework Problems Book Feedback - Learning From Data     General comments on the book     Chapter 1 - The Learning Problem     Chapter 2 - Training versus Testing     Chapter 3 - The Linear Model     Chapter 4 - Overfitting     Chapter 5 - Three Learning Principles     e-Chapter 6 - Similarity Based Methods     e-Chapter 7 - Neural Networks     e-Chapter 8 - Support Vector Machines     e-Chapter 9 - Learning Aides     Appendix and Notation     e-Appendices

All times are GMT -7. The time now is 01:26 PM.