#1
10-13-2013, 09:59 PM
 yaser Caltech Join Date: Aug 2009 Location: Pasadena, California, USA Posts: 1,477
LR and PLA with scaled input space

A post at another forum:
Quote:
 While answering question 7, I mistakenly took my test points from , which resulted in the number of iterations being approximately double of what it was when I corrected it to . I thought that this might simply be because doubling the interval provided more area, making the points more loosely scattered and allowing for a larger number of lines to satisfy the test cases. However, when I tested my hypothesis by taking intervals of and , etc., I found that the number of iterations was least for and went up in either direction from there. The points which defined my line were also taken from the larger intervals. Can anyone help explain why this might be the case?
If you scale , then the linear regression solution scales in the opposite direction (other things being equal) since it is trying to make match the same value ( or ). Now if you take the LR solution and use it as initial condition for PLA, the impact of each PLA iteration scales up with since you are adding to the weight vector at each iteration.

Put these together and you conclude that, as scales up and down, the impact of the LR solution vector on PLA goes down and up, respectively, and significantly so. On the large extreme, the LR solution behaves like the vector so you get the original PLA iterations. As gets smaller, kicks in as a good initial condition (with non-trivial size) and you gain some PLA iterations. As diminishes, PLA will take longer to correct the misclassified points that the LR didn't get simply because the PLA iteration becomes relatively smaller in the movement that it creates.
