Quote:
Originally Posted by ilya19
If I understand the problem correctly, P(X=1) is independent on P(h=f). Correct?

P(X=1) is defined over the full joint distribution, i.e.
. The h is marginalized out by the summation. However it doesn't mean that X and h are independent.
The reason why you can ignore P(X=1) is because in Bayesian analysis you usually don't care about the absolute probability of the dataset since it is just a constant that all of your hypotheses are divided by, equally, so it doesn't affect which hypothesis is aposteriori most probable.