![]() |
#1
|
|||
|
|||
![]()
In hw5, Q8 Q9. I've done the SGD, then I tried not to shuffle the 100 samples, instead, calculate from 1 to 100 in each epoch, and they come up with similar result.
|
#2
|
||||
|
||||
![]()
The shuffle is done anew for every epoch to get the benefit of randomness. For some data sets randomness doesn't make that much of a difference, but for some it does.
__________________
Where everyone thinks alike, no one thinks very much |
![]() |
Thread Tools | |
Display Modes | |
|
|