Quote:
Originally Posted by Andrs
...But if we have a "difficult data", the classification margins may be very narrow and if we handle the data in different sequences we may get different results.
|
That also sounds reasonable to me as an explanation.
Keith can you confirm that you shuffled your data first? I can understand why the output could be considerably different when cross validation is applied due to what data gets put into each fold. However for a straight learn and predict I hadn't anticipated a "considerable" difference in Ein and Eout when the data is the same but in a different order.