LFD Book Forum  

Go Back   LFD Book Forum > Book Feedback - Learning From Data > Chapter 1 - The Learning Problem

Reply
 
Thread Tools Display Modes
  #1  
Old 08-24-2015, 07:28 AM
henry2015 henry2015 is offline
Member
 
Join Date: Aug 2015
Posts: 31
Default Intuition of the step of PLA

According to the book, the update rule for PLA is w(t+1) = w(t) + y(t)x(t), and the book mentions "this rule moves the boundary in the direction of classifying x(t) correctly".

I understand that there is a convergence proof for PLA. But it is hard for me to see why such rule (or step) moves the boundary in the direction of classifying x(t) correctly. The formula just adds actual outcome (i.e. y(t)) times the misclassified point (i.e. x(t)) to the current weight matrix (which is just a vector of coefficient of hypothesis equation).

Any pointer will help.

Thanks in advance!
Reply With Quote
  #2  
Old 08-24-2015, 09:34 PM
yaser's Avatar
yaser yaser is offline
Caltech
 
Join Date: Aug 2009
Location: Pasadena, California, USA
Posts: 1,477
Default Re: Intuition of the step of PLA

The point {\bf x}_n would be correctly classified if {\bf w}^{\rm T}{\bf x}_n agreed in sign with y_n. Therefore, moving {\bf w}^{\rm T}{\bf x}_n in the direction of agreeing with that sign would be moving it in the right direction.

Adding {\bf x}_n y_n to {\bf w} will indeed achieve that, since it will add {\bf x}_n^{\rm T}{\bf x}_n y_n to the quantity {\bf w}^{\rm T}{\bf x}_n and what it adds agrees with y_n in sign since the {\bf x}_n^{\rm T}{\bf x}_n part is always positive.
__________________
Where everyone thinks alike, no one thinks very much
Reply With Quote
  #3  
Old 08-27-2015, 03:04 AM
elyakim elyakim is offline
Junior Member
 
Join Date: Aug 2015
Posts: 2
Default Re: Intuition of the step of PLA

Quote:
Originally Posted by yaser View Post
The point {\bf x}_n would be correctly classified if {\bf w}^{\rm T}{\bf x}_n agreed in sign with y_n. Therefore, moving {\bf w}^{\rm T}{\bf x}_n in the direction of agreeing with that sign would be moving it in the right direction.

Adding {\bf x}_n y_n to {\bf w} will indeed achieve that, since it will add {\bf x}_n^{\rm T}{\bf x}_n y_n to the quantity {\bf w}^{\rm T}{\bf x}_n and what it adds agrees with y_n in sign since the {\bf x}_n^{\rm T}{\bf x}_n part is always positive.
Earlier I indicated having difficulty with reading the equations in problem 1.3.
It works but I'm concerned I'm updating weights with a rule that is "not so smart":
  • the difference between the target function value and x2 (or the y-value in a visual simulation) for a misclassified point.
Especially I don't recognize the product of "x transpose and x" part.

To summarize my questions:
  1. would insight into vector computation make 'everything easier'?
  2. what is p in the equation? A random symbol?
Thanks again.
Reply With Quote
  #4  
Old 08-27-2015, 07:43 AM
henry2015 henry2015 is offline
Member
 
Join Date: Aug 2015
Posts: 31
Default Re: Intuition of the step of PLA

Quote:
Originally Posted by yaser View Post
The point {\bf x}_n would be correctly classified if {\bf w}^{\rm T}{\bf x}_n agreed in sign with y_n. Therefore, moving {\bf w}^{\rm T}{\bf x}_n in the direction of agreeing with that sign would be moving it in the right direction.

Adding {\bf x}_n y_n to {\bf w} will indeed achieve that, since it will add {\bf x}_n^{\rm T}{\bf x}_n y_n to the quantity {\bf w}^{\rm T}{\bf x}_n and what it adds agrees with y_n in sign since the {\bf x}_n^{\rm T}{\bf x}_n part is always positive.
Now, mathematically, I can see why adding y(t) * transpose of x(t) * x(t) to transpose of w(t) * x(t) makes it get closer to the solution. Thanks!

Just a bit hard to visualize it.
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -7. The time now is 07:41 AM.


Powered by vBulletin® Version 3.8.3
Copyright ©2000 - 2019, Jelsoft Enterprises Ltd.
The contents of this forum are to be used ONLY by readers of the Learning From Data book by Yaser S. Abu-Mostafa, Malik Magdon-Ismail, and Hsuan-Tien Lin, and participants in the Learning From Data MOOC by Yaser S. Abu-Mostafa. No part of these contents is to be communicated or made accessible to ANY other person or entity.