View Single Post
Old 06-09-2012, 03:01 PM
dbl001 dbl001 is offline
Join Date: Apr 2012
Posts: 11
Default Data Snooping, Classifiers


This is an excerpt from 'Mahout in Action' chapter 14 on building a classifier:

Preliminary analysis of data is critical to successful classification. Itís sometimes fun because the analysis often turns up Easter eggs like the Moon-Phase header line in table 14.2. These surprises can also be important in building a classifier, because they can uncover problems in the data or give you a key insight that simplifies the classification problem. Visualize early and visualize often.

Sean Owen, Robin Anil, Ted Dunning, Ellen Friedman (2012-01-16 18:35:04.792000-06:00). Mahout in Action (Kindle Locations 6297-6300). Manning Publications. Kindle Edition.

Would this be considered 'Data Snooping'?

Thanks in Advance
Reply With Quote