Choice of Error measure for Logistic Regression
From the textbook (page 91), it seems intuitive that the likelyhood function is the hypothesis.
However, for the case of logistic regression in another book, the choice of the likelyhood function is the bernoulli distribution where y_n is {0, 1}.
In the case of heart attacks as specified in the book, I could have changed the label of y_n from {-1, 1} to {0, 1} and used the bernoulli distribution as the likelyhood function instead.
eg. (h(x)^y_n)*(1-h(x)^(1-y_n)
Therefore, what are the factors involved in the consideration of choosing the error measure and what are the pros and cons in this case?
|