Binary Classification Error
Is the binary classification the F-score or just h(x) ≠y? In the one vs. all classifications, some of the positive classifications are half the size of others (e.g. 5 in train is 7.4%, 0 in test is 17.9%). The simple comparison can miss a huge error in false negatives.
|