Quote:
Originally Posted by ilya19
If I understand the problem correctly, P(X=1) is independent on P(h=f). Correct?
|
P(X=1) is defined over the full joint distribution, i.e.

. The h is marginalized out by the summation. However it doesn't mean that X and h are independent.
The reason why you can ignore P(X=1) is because in Bayesian analysis you usually don't care about the absolute probability of the dataset since it is just a constant that all of your hypotheses are divided by, equally, so it doesn't affect which hypothesis is a-posteriori most probable.