View Single Post
  #3  
Old 04-16-2012, 03:07 PM
htlin's Avatar
htlin htlin is offline
NTU
 
Join Date: Aug 2009
Location: Taipei, Taiwan
Posts: 601
Default Re: Coping with errors in training set

Quote:
Originally Posted by student322 View Post
In Lecture 3, it was mentioned that when a human classifies hand-written digits, the error rate is around 2%. It seems then that training sets will also have errors (perhaps even introduced deliberately by mischevious individuals), and that the learning algorithm will thus be able to do no better than the error rate in the training set. Is this true? Are there any methods to efficiently detect and/or correct errors in the training set, aside from reviewing the whole set numerous times manually?
In addition to the PhD theses, the following paper is on the issue and may be of interest to you.

Ling Li, Amrit Pratap, Hsuan-Tien Lin and Yaser S. Abu-Mostafa. Improving Generalization by Data Categorization. In A. Jorge et al., eds., Knowledge Discovery in Databases: PKDD '05, vol. 3721 of Lecture Notes in Artificial Intelligence, 157-168, Springer-Verlag, 2005.

http://www.csie.ntu.edu.tw/~htlin/pa...ingerprint.pdf

The methods "automatically" review the whole set.
__________________
When one teaches, two learn.
Reply With Quote