Re: Conflicting Lessons?
Actually, it seems that the two do not quite conflict.
The first lesson: match model complexity to data resources, means that you vary the complexity of your model based on how much data you have [perhaps employing a heuristic such as N = 10*dvc or dvc = N/10]
The second says that you should not look at the data itself, not merely the amount of data, but the fine structure, distribution, features, etc.
(I'd be glad to get feedback from course staff as to how well my attempted explanation gets at the meaning of the two sayings.)
|